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Sequence 

<213> OrganismName : Escherichia coli 0157 :H7 
<400> PreSequenceString : 

atgaaaagac ttcacaagag gttcctctta gctacgtttt gcgcgttatt aacagcaact 60 
ctccaggccg ccgatgtcac tatcactgtt aatggtcggg tagtcgctaa accctgcact 120 
attcaaacca aagaagctaa cgttaatctc ggggatcttt atacgcgcaa tctgcaacaa ' 180 
cctggttctg catctggctg gcacaatatt actttgtcat taaccgattg tccggctgaa 240 
acaagtgcag tgacggcaat cgtgacaggt tcaactgaca atacgggtta ttacaaaaat 3 00 

gaaggtactg ccgaaaatat tcagatagag cttagggatg accaggatgc gacgttaaaa 360 
aatggcgata gcaaaacggt tattgttgat gagatcactc gtaatgcaca gtttccactt 420 
aaggcaagag ctatcacggt gaatggaaac gcaagccagg gaacgatcga ggcgctaatc 480 
aatgtgatct acaccfcggca ataa 5 04 

<212> Type : DNA 
<211> Length : 504 

SequenceName : SEQ ID 431 

SequenceDe script ion : 

Sequence 



<213> OrganismName : Escherichia coli 0157:H7 
<400> PreSequenceString : 

atgcgggcga aacttctggg aatagtcctg acaaccccta ttgcgatcag ctcttttgct 60 

tctaccgaga ctttatcgtt tactcctgac aacataaatg cggacattag tcttggaact 120 

ctgagcggaa aaacaaaaga gcgtgtttat ctagccgaag aaggaggccg aaaagtcagt 18 0 

caactcgact ggaaattcaa taacgctgca attattaaag gtgcaattaa ttgggatttg 240 

atgccccaga tatctatcgg ggctgctggc tggacaactc tcggcagccg aggtggcaat 3 00 

atggtcgatc aggactggat ggattccagt aaccccggaa cctggacgga tgaaagtaga 3 60 

caccctgata cacaactcaa ttatgccaac gaatttgatc tgaatatcaa aggctggctc 420 

ctcaacgaac ccaattaccg cctgggactc atggccggat atcaggaaag ccgttatagc 4 80 

tttacagcca gaggtggttc ctatatctac agttctgagg agggattcag agatgatatc 540 

ggctccttcc cgaatggaga aagagcaatc ggctacaaac aacgttttaa aatgccctac 60 0 

attggcttga ctggaagtta tcgttatgaa gattttgagc taggtggcac atttaaatac 660 

agcggctggg tggaagcatc tgataacgat gagcactatg acccaggaaa aagaatcact 720 

tatcgcagta aagtcaaaga ccaaaattac tattctgttt cagtcaatgc aggttattac 780 

gtaacaccta acgcaaaagt ttatgttgaa ggcacatgga atcgggttac gaataaaaaa 840 

ggtaatactt cactttatga tcacaatgat aacacttcag actacagcaa aaatggtgca 900 

ggcatagaaa actataactt catcactact gctggtctta agtacacctt ttaa 954 

<212> Type : DNA 
<211> Length : 954 

SequenceName : SEQ ID 432 

SequenceDe script ion : 

Sequence 



<213> OrganismName : Escherichia coli 0157:H7 
<400> PreSequenceString : 

ttgtttttta agcgaggaaa gattttgagt gcgggacgcc tgaataaaaa atctctgggt 60 

atcgtgatgt tgttatcggt tggactgctt ttggcgggct gttcgggtag caaatcatcc 120 

gatacaggaa cgtattccgg ctccgtttac accgtgaaac ggggggatac gctatatcgt 180 

atttcgcgca ccacgggaac cagcgtaaaa gagctggcgc gactgaacgg catttccccc 24 0 

ccttacacca ttgaagttgg tcagaaacta aaactgggtg gggcgaaaag tagcagtagt 3 00 

acacgtaaat caaccgccaa atcaacgacc aaaaccgcat cggttacacc gtcatcagcg 3 60 

gtaccgaaat cttcctggcc gccagtaggg caacgttgtt ggttatggcc aacgacaggg 420 

aaagttatca tgccgtattc gacagcagat ggcggcaata aagggattga tatctcagct 48 0 

ccacggggta cacctattta cgccgcgggt gcaggaaagg tggtgtatgt gggcaaccag 540 

ctgcgtggct acggtaatct catcatgatt aaacacagtg aagattacat tacggcttac 60 0 

gcccataatg acacgatgct ggtaaataat gggcaaagcg tgaaggctgg gcaaaaaatc 660 

gccaccatgg ggagcacgga tgcggcatct gttcgcctgc atttccagat tcgttaccgt 720 

gcaacggcaa ttgatccgct acgttacttg ccgcctcagg gcagcaagcc aaaatgctga 78 0 

<212> Type : DNA 
<211> Length : 780 

SequenceName : SEQ ID 433 

SequenceDescription : 
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Sequence 



<213> OrganismName : Escherichia 
5 <40 0> PreSequenceString : 

atgccaacac caaatcctct ggcaccggtg 
aacgggaacg gcgacccata tgcaaacccg 
aaggttaaag acctgacgcc cggcgaactg 
gatgatgaag atgcggactg ggccgcgacc 

10 agcttcacgc tggcgtggat gcccggagag 
aatgaaggtg atacccgtgc ctataaaatc 
cgcggctggg tcagcagtat cggtaaggcg 
gtgaaagtca ccaacgtggg acgtccgtcg 
gcaaccggca tgaccgtgac gcctgccagc 

15 ctgaccgtgg cattccagcc ggaaggcgca 
gataaaacaa aagccaccgt gtcggtcagt 
gcaggcaagg tcaacattcc ggtcgtatcc 
atcaacgtca ccgccagtta a 
<212> Type : DNA 

20 <211> Length : 741 

SequenceName : SEQ ID 434 
S equenceDe script i on : 



coli 0157:H7 

aaaggggccg ggaccacact gtgggtttat 60 

ctttcagaca atgactggtc gcgtctggca 12 0 

accgctgagt cctatgacga cagttatctc 18 0 

ggacaggggc agaaatccgc tggagatacc 240 

caggggcagc aggcgctgct ggcgtggttt 3 00 

cgcttcccga acggcacggt cgatgtgttc 360 

gtgacggcga aggaagtgat cacccgcacg 42 0 

atggcagaag atcgcagcac ggtaacagcg 480 

acctcggtgg tgaaagggca gagcacgacg 540 

accgacaaga gcttccgtgc ggtgtctgcg 6 00 

ggtatgacca tcaccgtgaa aggtgttgct 660 

ggtaatggtg agtttgctgc ggttgcagaa 720 

741 



Sequence 

25 

<213> OrganismName : Escherichia 
<4 00> PreSequenceString : 
atgtctgctt tgtatgaacg ctcacagctg 
actgctgaaa ctatggataa ggcggaatat 

30 cagttcaccg ccgggcagaa acaggatatt 
gagaacatca atggtctggg ggcgtcgtct 
aatcaggccc agaacgccct gcgtgatgcc 
gtgctgtttc cgtccggtaa gggctttaaa 
tcatccggta ccaacggcgt ggtggctgca 

35 gtgtcctttg tggtaccgct ggcgtttgtg 
accggtgcgc tgctgacaat gtcagtcagt 
gcctggaaga aggatggtca gccggttgac 
ggtgcgcagt ccgctgatgc ggggaaatat 
gcacagagtg tgacgtctgt tgaatgcacc 

40 

<212> Type : DNA 

<211> Length : 717 

SequenceName : SEQ ID 435 
SequenceDescription : 

45 

Sequence 



COli 0157:H7 



acgcaggtga tgatttcatc tgccccggcg 60 

ctgcgcctgg actgcaccat caaggaagtg 12 0 

gatgtgacca cgctctgctc cacagagcag 18 0 

gagatttcca tgtcgggcaa tttttatctg 240 

tatgacaatg acgcgttgta tgcgtttaag 300 

ttcctggcgg aagtgcgcca gcacacctgg 3 60 

acgttctcac tgcgtctgaa aggcaaaccg 420 

aaaaatctgg ataagacact taccgtgaat 48 0 

gccaacgggg gaacgccgcc gtataaatac 540 

gggcagacga cagacacctt cagtaagcca 60 0 

acctgtgtgg tgaccgattc ggcagagaaa 660 

gtgacagtga gcgcagccgc cggataa 717 



<213> OrganismName : Escherichia coli 0157:H7 
<400> PreSequenceString : 

50 atgaaaaaga gcactctggc attagtggtg atgggcattg tggcatctgc atccgtacag 60 

gccgcagaaa tatataacaa agacggtaat aaactggatg tctatggcaa agttaaagcc 120 

atgcattata tgagtgataa cgacagtaaa gatggcgacc agagttatat ccgttttggt 180 

tttaaaggcg aaacacaaat taacgatcaa ctgactggtt atggtcgttg ggaagcggag 240 

tttgccggaa ataaagcgga gagtgatact gcacagcaaa aaacgcgtct cgcttttgcc 300 

55 ggattgaagt ataaagattt gggttctttc gactatggcc gtaacctggg cgcgttgtat 3 60 

gacgtggaag cctggaccga tatgttcccg gaatttggtg gcgactcctc ggcgcagacc 420 

gacaacttta tgaccaaacg cgccagcggt ctggcgacgt atcggaacac cgacttcttc 48 0 

ggcgttatcg atggcctgaa cttaaccctg caatatcaag ggaaaaacga aaaccgcgac 540 

gttaaaaagc aaaacggcga tggcttcggc acgtcattga catatgactt tggcggcagc 60 0 

60 gatttcgcca ttagtggggc ctataccaac tcagatcgca ccaacgagca gaacctgcaa 66 0 

agccgtggca caggcaagcg tgcagaagct tgggctacag gtctgaaata cgatgccaat 72 0 

aatatttatc tggcaacttt ttattctgaa acacgcaaaa tgacgccaat aactggcggc 78 0 

tttgccaata agacacagaa ctttgaagcg gtcgctcaat accagtttga ctttggtctg 840 

cgtccatcgc tgggttatgt cttatcgaaa gggaaagata ttgaaggtat cggtgatgaa 90 0 

65 gatctggtca attatatcga tgtcggggct acatattatt tcaacaaaaa tatgtcagcg 960 

tttgttgatt ataaaatcaa ccaactggat agcgataaca aattgaatat taataatgat 102 0 

gatattgtcg cggttggcat gacctatcag ttttaa 1056 
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<212> Type : DNA 

<211> Length : 1056 

SequenceName : SEQ ID 43 6 
SequenceDe script ion : 

5 

Sequence 



<213> OrganismName : Escherichia coli 0157:H7 
<4 00> PreSequenceString : 

10 atgcgtgtca aacatgcagt agttctactc atgcttattt cgccattaag ttgggctgga 60 

accatgactt tccagttccg taatccaaac tttggtggta acccaaataa tggcgctttt 120 

ttattaaata gcgctcaggc ccaaaactct tataaagatc cgagctataa cgatgacttt 18 0 

ggtattgaaa caccctcagc gttagataac tttactcagg ccatccagtc acaaatttta 240 

99tgggctac tgtcgaatat taataccggt aaaccgggcc gcatggtgaccaacgattat 3 00 

15 attgttgata ttgctaaccg cgatggtcaa ttgcagttga acgtgacaga tcgtaaaacc 360 

ggacaaacct cgaccatcca ggtttcgggt ttacaaaata actcaaccga tttttaa 417 



<212> Type : DNA 
<211> Length : 417 
20 SequenceName : SEQ ID 43 7 

SequenceDe script ion : 

Sequence 



25 <213> OrganismName : Escherichia coli 0157:H7 
<4 00> PreSequenceString : 

atgaaaagaa aagttctggc aatgctggtc ccggcgttat tagttgctgg cgcagcaaat 60 
gcggctgaaa tttataataa agatggcaat aaactggatt tgtacggaaa agtagcgggc 120 
ctgcactact tctctgatga tgctagcagc gatggcgaca tgtcatatgc ccgtatcggt 180 

30 ttcaaaggtg aaactcagat cgctgaccaa ttcactggtt atggtcagtg ggaatttaac 240 
attggcgcaa acggtcctga aagcgacaag ggtaataccg caacgcgtct ggcatttgca 300 
ggtttcggct ttggtcagaa tggtactttc gactatggtc gtaactacgg tgtcgtatat 360 
gacgtagaag catggaccga tatgctgcca gaatttggtg gagataccta tgctggcgct 420 
gacaacttca tgaacggtcg tgctaacagc gtagcaacct atcgtaacaa tggtttcttt 480 

35 ggtcaagttg atggtctgaa ctttgcactc cagtatcagg gtaacaacga gaaaagcgga 540 
ttatttgatc aagaaggttc aggtaacggt aatggacgta aacttgctaa agagaacggc " 600 
gacggttcag tatgtccact tcctatgact ttgactttgg tttaa 645 
<212> Type : DNA 
<211> Length : 645 

40 SequenceName : SEQ ID 43 8 

SequenceDe script ion : 

Sequence 



45 <213> OrganismName : Escherichia coli 0157:H7 
<400> PreSequenceString : 

ttgaacacgg tgactctgga aggaggaacc ttcaataaca acggaacgct taatgacgtc 60 

gtaaaaattg agaaaaacag caacgcggta attaataaca ccggttccct gtcaacttta 120 

caacttcacg atggtacggt gaataacagc ggtattgcgt cggcgcgcgt taacgctcag 180 

50 ggcgatgcgg tattcaataa ccttgcaggc ggcgaggcgc gtaaaggcgc gattctctat 240 

aactctgcgg tagtgaataa cgcgggtacc tggaaaatgg gttatcagga tgaaaacaac 3 00 

aatgccggga cgctggatat tgacgataag tcaacgttca acaacagcgg caaactcatc 3 60 

cttgataaca gcaaaaacgc cattcgcttc cagggcagca atgctaacgc cacgttatat 420 

aacaccggtg aaatgacgct ggatgccgca ttaggtgcgg gcgctattct ctacgacgat 480 

55 ggcgccagcg agtttattaa caagggcgtg gtggatgcga aagtcaccgt ggcggtaagt 540 

actgccggtg cgacagaaag cgatgccttc ttgtggaacc aggatggcgg ggtaattaac 60 0 

ttcgataaag acaacgccag cgcggttaaa ttcacccaca acaactatgt tgctctcaac 660 

gatggtgtaa tgaacatcag cggcaacaac gccgtggcga tggaaggcga taaaaacgca 72 0 

cagctggtta acaacggcgt tatcaatctc ggtaccgaag gcaccaccga taccggattg 780 

60 actggtatgc aactggatgc caatgccacc gccgatgcgg taattgagaa caacggcacg 840 

atcaatattt ttgctaacga ctcgtttgcg ttcagcgtac tgggcacaga aggtcatatc 900 

gttaacaacg gtacggtggt gattgccgac ggcgtgactg gttcggggtt gattaagcag 960 

ggcgacagcg tgaatgtgga aggggtgaac ggcaacagcg gtaacaatac cgaagtgcat 1020 

tacaccgact acacgttgcc ggacatgcca aacacttaca ccacttcacc tttcagtgaa 1080 

65 acgactgata gcggtagtag cgatggcagc agtaacaacc tcaacggcta tatcgtcggt 1140 

accaacgttg acggcagcgc cggtaaactg aaggtcaaca acgccagcat gaatggtgtc 1200 

gggatcaaca ccggtttcgc cgccggtacg gcagacacca cggtcagttt cgacaacgtg 1260 
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gtggaaggca tcaacctgac cgacgccgat 

gccaaaggca gcaccgatgc cagcggcaac 

accgatgtgg cgaccgatgc ttcggtgaac 

accaataacg agctgtatac cagcctgaac 

5 ctgaagcagg tgagcggtag ccaggcgacc 

aaccgcttca gcatgctggc ggatgccgcg 

gtggtggcga aaggtgaccc gcgtgcggaa 

gcactgcgta aaaccgttga cctgagcgaa 

gcgcgtctgg atggcgacgg tgcgcagaaa 

10 agccagttct ttggcctgaa gcaccagatg 

gcgctgcgtt atgacgtgca taatctcgac 

agcaaaacgg cggatacgga tgtgaaacag 

aaaacctttg agccgcgcga agggctgaaa 

cactcgctgg aaggcggcta tcaggagcgc 

15 agcggcagcg aaacggcggt ggacagcatc 

ggcggctgga gcgcgaatgc gacgctggaa 

cagcgcacgg caagccttgc aggggcaggc 

aagggcggcg gtatcaacag cctggcgagc 

tcgctgaatc tggatgcgta tcactggaaa 

20 ctgaacttta agaaaacgtt ctaa 

<212> Type : DNA 
<211> Length. : 2424 

SequenceName : SEQ ID 43 9 
SequenceDe script ion : 

25 

Sequence 



gccatcacct caacgtccgt ggtatggacc 132 0 

gttgacgtca tcatgagcaa aaacgcctac 13 8 0 

gatgtggcga aggcactgga tgcgggttac 1440 

gtgggcacca ctgctgaact gaatagcgcc 1500 

acggtattcc gtgaagcgcg tgtgttaagc 1560 

ccgaaggtgg gcaatggcct ggcctttaac 162 0 

ctcggaaata acaccgagta tgacatgctg 168 0 

agccagagca tgagcctgga atacggtatc 1740 

gcgggcgaca atggcgtaac cggcggctac 1800 

tccttcgaca atggtatgcg ttggaacaac 1860 

agcagccgct cggtcgctta cggcgacgtc 192 0 

cagtacctgg agttgcgtag cgaaggggcg 198 0 

atcaccccgt acgccggagt gaaactgcgt 2 040 

aatgccggag actttaacct gagcatgaac 210 0 

gtcgggctga aactggacta cgcagggaaa 2160 

ggcgggccga acctgagcta cagcaagagc 222 0 

agccagcact ttaacgtcga tgacggtcag 22 8 0 

gtcggcgtga agtacagtag caaagaaagt 23 40 

gaggacggca tcagcgacaa aggcgtgatg 240 0 

2424 



<213> OrganismName : Escherichia coli 0157:H7 
<400> PreSequenceString : 

30 atgcttaatg gaattagtaa cgctgcttct acactagggc ggcagcttgt aggtatcgca 60 

agtcgagtga gctctgcggg gggaactgga ttttctgtag cccctcaggc cgtgcgtctt 120 

actccggtga aagttcattc ccctttttct ccaggctcgt cgaatgttaa tgcgagaacg 180 

atttttaatg tgagcagcca ggtgacttca tttactccct ctcgtccggc accgccgcca 240 
ccgacaagtgr gacaggcatc cggggcatcc cgacctttac cgcccattgc acaggcatta . 3 00 

35 aaagagcact tggctgccta tgaaaaatcg aaaggtcctg aggctttagg ttttaagccc 3 60 

gcccgtcagg: caccgccgcc accgacaagt ggacaggcat ccggggcatc ccgaccttta 420 

ccgcccattg cacaggcatt aaaagagcac ttggctgcct atgaaaaatc gaaaggtcct 480 

gaggctttag: gttttaagcc cgcccgtcag gcaccgccgc caccgacaag tggacaggca 540 

tccggggcat cccgaccttt accgcccatt gcacaggcat taaaagagca cttggctgcc 60 0 

40 tatgaaaaat cgaaaggtcc tgaggcttta ggttttaagc ccgcccgtca ggcaccaccg 660 

ccaccgacag ggcctagtgg actaccgccc cttgcacagg cattaaaaga tcatttagct 720 

gcctatgagc aatcgaagaa agggtaa 747 
<212> Type : DNA 
<211> Length : 747 

45 SequenceName : SEQ ID 440 

SeqixenceDescription : 

Sequence 



50 <213> Organ israName : Escherichia 
<400> PreSequenceString : 
atgaacaaga agattcattc cctggccttg 

agccgaccga tactcctgtt 
acttacaggc gcctggcgtt 
55 aaccccgttg cccgcgatgt ttcggagatc 
ggtaactcca ccagtggtca gcgagggaat 
ccggaaaaca cgctgatttt gattgacggc 

gtggcgagcg cgatacccgt 
ttgaagttct gcgtggtccg 
ttaacatcat taccaaaaaa 
gcatatttca atgcgccaga acataaagag 
ctgaccggtc cgctgggcga cgaattcagc 

cgtgggatat caaccagggt 
cagccgggcg cgaaggggtg 
65 tgggacttcg caccattgca atcgctggaa 
ctgtatgcgg gcgataccca gaacaccaac 
gatgaaacca accgcctgta tcgccagaac 



coli 0157:H7 



60 



caggcacaag 
gccgagcaga 
aaccccgttg 



cagggctggc 
attgaacgta 
ggcggcgtgg 



caggctgacg 
acgacgttac 
tgggacttcg 



ttggtcaatc 
tcacatgacg 
tcgaccatca 
atccgtacca 
aaccgacaga 
aagccggtaa 
ggtgatacct 
gcagctgcgc 
ggcagcggcg 
gaaggtgcca 
ttccgcttgt 
catcagtccg 
atcaataaag 
ctggaagcgg 
tctgacgctt 
tactcgctga 



tggggattta 
atactattgt 
ccgcagatga 
tgccaggcgt 
ttgatattcg 
gcagccgtaa 
cctgggtgcc 
gttatggcaa 
agtggcacgg 
ccaaacgcac 
atggcaacct 
cgcgtgccgg 
atattaatgg 
gttacagccg 
acactcgctc 
cctggaacgg 



tggggtagcg 

cgttaccgcc 
aatccgcaaa 
taacctgacc 
cggtatgggt 
ctcggtgcgt 
acctgaaatg 
cggcgcggcg 
ctcctgggac 
taactttagc 
cgacaaaacc 
aacgtatgcc 
cgtggtgcgc 
ccagggtaac 
gaaatatggc 
tggctgggat 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
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aacggcgtga ccaccagcaa ctgggtgcag tacgaacaca cccgtaactc gcgtattccg 1020 

gaaggtctgg cgggcggtac cgaagggaaa tttaacgaaa aagcgacaca ggatttcgtc 1080 

gafcaacgatc ttgatgacgt gatgctgcat agcgaagtta acctgccgat tgatttcctc 1140 

gtfcaaccaga cgctgacgct gggtacggag tggaatcagc agcggatgaa ggacttaagt 1200 

tccaacaccc aggcactgac cggaacgaat accggtggcg ctattgatgg cgtgagtgcc 1260 

accgaccgta gcccgtattc aaaagcagaa attttctcgc tgtttgccga aaacaacatg 132 0 

gagctgactg acagcaccat cgtaacgccg gggctgcgtt tcgatcatca cagtattgtc 13 80 

ggcaataact ggagcccggc gctgaacata tcgcaaggtt taggcgatga cttcacgctg 1440 

aaaatgggca ttgcccgcgc ctataaagcg ccgagcctgt accagactaa cccgaactac 1500 

attctctaca gtaaaggtca gggctgctafc gccagcgcgg gcggctgcta tctgcaaggt 1560 

aacgatgacc tgaaagcaga aaccagcatc aacaaagaga ttggtctgga gttcaaacgc 1620 

gacggctggc tggcgggcat cacctggttc cgtaacgatt atcgcaataa gattgaagca 1680 

ggctatgttg ctgtagggca aaacgcagtc ggcaccgatc tctatcagtg ggataacgtg 1740 

ccgaaagcgg tggttgaagg tctggaagga tcgttaaacg .taccggfctag cgaaapggtg 18 0 0 

atgtggacta ataacatcac ttatatgctg aagagtgaaa acaaaaccac gggcgaccgt 18 60 

ttgtcgatca tcccggagta tacgttgaac tcaacgctga gctggcaggc acgagaagat 192 0 

ttgtcgatgc aaacgacctt cacctggtac ggtaagcagc agccgaagaa gtacaactat 19 8 0 

aaaggtcagc cagcggfctgg accggaaacc aaagaaatta gtccttacag cattgttggg 2040 

ctcjagcgcga cgtgggatgt gacgaagaat gtcagtctga ccggcggcgt ggacaatctg 2100 

ttcgacaaac gtttgtggcg tgcgggtaat gcccagacca cgggcgattt ggcaggggcc 2160 

aactatatcg ccggtgccgg ggcgtatacc tataacgagc cgggacgtac gtggtatatg 222 0 

agcgtaaaca ctcacttctg a 2241 
<2X2> Type : DNA 
<211> Length. : 2241 

SequenceName : SEQ ID 441 

SequenceDescription : 

Sequence 



<2X3> OrganismName : Escherichia coli 0157:H7 
<4O0> PreSequenceString : 

tt: 9ggagggc gatttagtct caggtacaaa aaactctcat atagattcgt ttttctgacc 60 
ttggcaggtt gttcttcagt aggcaaccag tcattgaaaa atgagacgca ggaaagtgtg 12 0 

aaaaccaaaa ttgttaaagg caaaactaca aaacaggacg tgttagcatc gttcggtgaa 180 
cctgacagcc gttctttgat cgatggtgaa gaacaatggt catacactat gtataacagc 240 
cagtccaaag caacctcttt catccccgtt gtgggactgc ttgcaggtgg cgcagactca 300 
caaactaaat ctctgacagt ttctttcaaa ggcgaaaaag tcagcacata catctttaat 360 
gctggaacaa gcaacgtgaa gactggcatt ttttag 396 
<2X2> Type : DN"A 
<2X1> Length : 396 

SequenceName : SEQ ID 442 

SequenceDescription : 

Sequence 



<213> Organi srnName : Escherichia coli 0157 :H7 
<4O0> PreSequenceString : 

atgaaaaaaa ttgcatgtct ttcagcactg gccgcagttc tggctttcac cgcaggtact 60 

tccgtagctg cgacttctac cgtaactggc ggttacgcac agagcgacgc tcagggccaa 120 

atgaacaaaa tgggcggttt caacctgaaa taccgctatg aagaagacaa cagcccgctg 18 0 

ggtgtgatcg gttcattcac ttacaccgag aaaagccgta ctgcaagctc tggtgactac 240 

aacaaaaacc agtactacgg catcactgct ggtccggctt accgcattaa cgactgggca 3 00 

agcatctacg gtgtagtggg tgtgggttat ggtaaattcc agaccactga atacccgacc 3 60 

tacaaacacg acaccagcga ctacggtttc tcctacggtg ctggtctgca gttcaacccg 42 0 

atggaaaacg ttgctctgga cttctcttac gagcagagcc gtattcgtag cgttgacgta 480 

ggcacctgga ttgccggtgt tggttaccgc ttctaa 516 
<2X2> Type : DNA 
<2X1> Length : 516 

SequenceName : SEQ ID 443 

SequenceDescription : 

Sequence 



<213> OrganismName : Escherichia coli 0157 :H7 
<4O0> PreSequenceString : 

atgaagagta tagcaacact ggttgtgtgt gcaatctccg ggattgcctg tgtaaattta 60 
tctgcacatg cagcagaagg agagcataca atttctctgg ggtatgcgca ctttcagttt 12 0 
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ccgggactga aggattttgt aaaggatgcg actgctcata acagggagac tttcagtcat 180 

ttcgtcaaca gaaactactt ttcttcattg ggcgaatata cagatggtcg ggtcagtgga 240 

tatgaaggca aggataaaaa tccacagggc attaatatca ggtatcgcta cgagataacg 3 00 

gatgattttg gcgttatcsic ctcttttaca tggacgcgtt ctctcactaa ctcacagaca 3 60 

5 tttattgatg tgcagtcacjc cgatcatacc aggaagatta agaatccggc agcttctgcc 420 

agaacggata tcagggcgaa ttactggagt ctgttagcgg ggccttcatg gcgggttaat 48 0 

cagtacatga gtttatatgjc gatggcaggg atgggcgttg ctaaagttag cgctgacctg 540 

aaaattaagg acaatattaa cagtagtggc ggattttctg aaagcaacag cacgaaaaaa 600 

acctcccttg cgtgggctgc aggtgcacag tttaacctga atgagagtgt tacactggat 660 

10 gtggcttacg aaggttccgg ctctggcgac tggcgcacga gtggcgttac tgctggcatt 720 

ggcctgaaat tctga 735 
<212> Type : DNA 
<211> Length : 73 5 

SequenceName -~ SEQ ID 444 



15 SequenceDe script ion : 

Sequence 



<213> OrganismName = Escherichia coli 0157:H7 

20 <400> PreSequenceSt:r/ing : 

atgcgtaaac tttatgccgc cattttgtcc gcagccattt gtctgaccgt atccggtgcg 60 

cctgcatggg cgtctgagca gcaggccacg ctgagcgcgg ggtatcttca tgtctcgacg 120 

aacgctcccg gtagcgataa tcttaacggg attaacgtga aataccgtta tgaattcacg 180 

gacacgctgg ggctggtgaic gtcattcagc tatgcaggag acaggaatcg ccagattacc 240 

25 cgttacagcg atacccgctg gcatgaagat tccgtgcgta accgctggtt cagcgtaatg 3 00 

gcggggccgt ctgtgcgcgft gaatgaatgg ttcagcgcgt atgcgatggc gggagtggct 3 60 

tacagccgtg tgtcgacttt ctccggggat tatctccgcg taactgacaa caaggggaaa 420 

acgcacgatg tgctgacccgg aagtgatgac ggtcgccaca gcaacacgtc tctggcgtgg 480 

ggggctggcg tgcagtttaa cccgaccgaa tccgtggcca ttgatattgc ttatgaaggc 540 

30 tccggcagtg gcgactggcg cactgacggt ttcatcgtgg gtgtcggtta taagttctga 600 



<212> Type : DNA 
<211> Length : 60 0 

SequenceName z SEQ ID 445 
35 SequenceDescription : 

Sequence 



<213> OrganismName z Escherichia coli 0157 :H7 

40 <400> PreSequenceS tiring : 

atgcgtaaac tttatgcccjc cattttgtcc gcagccatct gtctggccgt atccggcgcg 60 

cctgcatggg cgtctgagca gcaggccacg ctgagcgcgg ggtatcttca tgcccggacg 120 

agcgctcccg gtagcgataa tcttaacggg attaacgtga aataccgtta tgaatttacg 180 

gacacgctgg ggctggtgsc gtcattcagc tatgcaggag acaagaatcg ccagcttacc 240 

45 cgttacagcg atacccgctg gcatgaagat tccgtgcgta accgctggtt cagcgtaatg 3 00 

gcggggccgt ctgtgcgcgt gaatgaatgg ttcagcgcgt atgcgatggc gggtgtggct 3 60 

tacagccgtg tgtcgacttt ctccggggat tatcttcgcg taactgacaa caaggggaaa 420 

acgcacgatg tgctgaccgg aagtgatgac ggtcgccaca gcaacacgtc tctggcgtgg 480 

9999 c tggcg tgcagtttaa cccgaccgaa tccgtggcca ttgatattgc ttatgaaggc 540 

50 tccggcagtg gcgactggcg cactgacggt ttcatcgtgg gtgtcggtta taagttctga 600 



<212> Type : DNA 
<211> Length : 600 

SequenceName = SEQ ID 446 
55 SequenceDescription : 

Sequence 



<213> Organ ismName = Escherichia coli 0157:H7 

60 <400> PreSequenceS taring : 

atgcgtaaac tttatgccgc cattttgtcc gcagccatct gtctggccgt atccggcgcg 60 

cctgcatggg cgtctgagca gcaggccacg ctgagcgcgg ggtatcttca tgcccggacg 120 

agcgctcccg gtagcgataa tcttaacggg attaacgtga aataccgtta tgaatttacg 18 0 

gacacgctgg ggctggtgsc gtcattcagc tatgcaggag acaagaatcg ccagcttacc 240 

65 cgttacagcg atacccgctg gcatgaagat tccgtgcgta accgctggtt cagcgtaatg 3 00 

gcggggccgt ctgtgcgcgt gaatgaatgg ttcagcgcgt atgcgatggc gggtgtggct 360 

tacagccgtg tgtcgacttt ctccggggat tatcttcgcg taactgacaa caaggggaaa 42 0 
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acgcacgatg tgctgaccgg aagtgatgac ggtcgccaca gcaacacgtc tctggcgtgg 480 
ggggctggcg tgcagtttaa cccgaccgaa tccgtggcca ttgatattgc ttatgaaggc 540 
tccggcagtg gcgactggcg cactgacggt ttcatcgtgg gtgtcggtta taagttctga 600 

<212> Type : DNA 
<211> Length : 600 

SequenceName : SEQ ID 447 

SequenceDescription : 

Sequence 



<213> OrganismName : Escherichia coli 0157:H7 
<4 00> PreSequenceString : 

atgcgtaaac tttatgccgc cattttgtcc gcagccatct gtctggccgt, atccggcgcg £0 

cctgcatggg cgtctgagca gcaggccacg ctgagcgcgg ggtatcttca tgcccggacg 120 

agcgctcccg gtagcgataa tcttaacggg attaacgtga aataccgtta tgaatttacg 180 

gacacgctgg ggctcrgtgac gtcattcagc tatgcaggag acaagaatcg ccagcttacc 240 

cgttacagcg atacccgctg gcatgaagat tccgttcgta accgctggtt cagcgtaatg 3 00 

g c ggggccgt ctgtgcgcgt gaatgaatgg ttcagcgcgt atgcgatggc gggtgtggct 3 60 

tacagccgtg tgtcgacttt ctccggggat tatcttcgcg taactgacaa caaggggaaa 420 

acgcacgatg tgctgaccgg aagtgatgac ggtcgccaca gcaacacgtc tctggcgtgg 480 

ggggctggcg tgcagtttaa cccgaccgaa tccgtggcca ttgatattgc ttatgaaggc 540 

tccggcagtg gcgaotggcg cactgacggt ttcatcgtgg gtgtcggtta taagttctga 600 

<212> Type : DKTA 
<211> Length : 6 00 

SequenceMame r SEQ ID 448 

SequenceDescription : 



<213> OrganismlsFajne : Escherichia coli 0157:H7 
<400> Pre&equeaceString z 

atggtcatga gccagaaaac cctgtttaca aagtctgctc tcgcagtcgc agtggcactt • 60 

atctccaccc aggcctggtc ggcaggcttt cagttaaacg aattttcttc ctctggcctg 120 

ggccgggctt afctcragggga aggcgcaatt gccgatgatg caggtaacgt cagccgtaac 180 

cccgcattga ttaccatgtt tgaccgcccg acattttctg cgggtgcggt ttatattgac 240 

ccggatgtaa atafccagcgg aacgtctcca tctggtcgta gcctgaaagc cgataacatc 30 0 

gcgcctacgg catgggttcc gaacatgcac tttgttgcac cgattaacga ccaatttggt 360 

tggggcgctt ctattacctc taactatggc ctggcaacag agtttaacga tacttatgca 42 0 

ggcggctctg tcgg-gggtac aaccgacctt gaaaccatga acctgaactt aagcggtgcg 48 0 

tatcgcttaa ataatgcatg gagctttggt cttggtttca acgccgtcta cgctcgcgcg 54 0 

aaaattgaac gtttcgcagg cgatctgggg cagctggttg ctggtcagat tatgcaatct 600 

cctgccggga agactcctca agggcaagca ttggcagcta ccgccaacgg tatcgacagt 66 0 

aataccaaaa tcgctcatct gaacggcaac cagtggggct ttggatggaa cgccggtatc 72 0 

ctgtatgaac tggataaaaa taaccgctat gcactgacct accgttctga agtgaaaatt 78 0 

gacttcaaag gtaactacag cagcgatctt aatcgtgtgt ttaataacta cggtttgcca 84 0 

attcctaccg ccacaggtgg cgcaacgcaa tcgggttatc tgacgctgaa cctgcctgaa 90 0 

atgtgggaag tgtcgggtta taaccgtgtt gatccgcagt gggcgattca ctatagcctg 960 

gcttacacca gctcjgagtca gttccagcag ctgaaagcga cctcaaccag tggcgacacg 102 0 

ctgttccaga aacatgaagg ctttaaagat gcttaccgca tcgcgttggg taccacttat 1080 

tactacgatg ataactggac cttccgtacc ggtatcgcct ttgatgacag cccagttccg 1140 

gcacagaatc gttctatctc cattccggac caggaccgtt tctggctgag tgcaggtacg 1200 

acttacgcgt ttaataaaga tgcttcagtc gacgttggtg tttcttatat gcacggtcag 1260 

agcgtgaaaa ttaacgaagg cccataccag ttcgagtctg aaggtaaagc ctggctgttc 1320 

ggtactaact ttaactacgc gttctga 1347 
<212> Type : DNA 
<211> Length : 1347 

SequenceMame : SEQ ID 449 

SequenceDescription : 

Sequence 



<213> OrganisiaName : Escherichia coli 0157:H7 
<400> PreSequerxceString : 

atggcctttt ctcaagcggt fcagcggafcta aacgctgccg ccaccaacct cgatgttatt 60 
ggcaacaata tcgccaactc cgccacctac ggctttaaat caggcacggc ctcttttgcc 120 
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gatatgtttg ccggttcgaa agtgggactg 
tttaccgatg gcacgaccac caacaccggg 
ggttttttcc gtctggtaga tagcaacggt 
aagctggatg aaaatcgtaa cctggtgaat 
5 gcaaccggta cgccgccgac tattcagcaa 
aataccctga tggcagcgaa aactaccacc 
agcgatccgc ttccctctgt taacgcattt 
aaaggttcgg tgactgtttt cgacagtcag 
gtgaagaccg gggataataa ctggcaggtc 

10 acagccgagc ctgcaatgaa gctggtgttt 
acagagaata ttaccaccgg cgcaattaac 
ttcctcaact ccatgcagca aaataccggc 
ggctacaaac cgggcgatct ggtgagttat 
aactattcca. acgaacaaac ccaactgctg 

15 aacgaaggtc tggcatcccja aggcgacaac 
gcgctgttgg ggacagccgg gacgggcaac 
gcgtccaacg tcgatctcag taaagaactg 
cagtctaacg cccagaccat caaaacccag 
cgctaa 

20 <212> Type : DNA 

<211> Length : 1206 

SequenceName : SEQ ID 450 
SequenceDe script ion : 

25 Sequence 



<213> OrganismName : Escherichia 
<400> PreSequenceString : 
atgtctaaat cgactttttt acacattctg 

30 caatcatccg cttgggctaa ctgtacaaac 
acagcgctta ttgagttcgjg aaaaattaat 
tcactcctcg cgacaacgcyt cgtacctccc 
agctccgtat tgtgggaatg tgatgcaaca 
accaatggcg acgaccgccjt ggggggcttt 

35 gatgfccfcatg ccacctggtt cgcfctttgfcc 
cttggtcgtt actggaagaa agtgcccatc 
cagattcgct tacaggatat ccctcctctt 
cctgatacat cagcaacaac aagttggtgc 
ggattcgcaa aaccttccgg tacaatctat 

40 ctttccggta ccagcggcat tttatttggt 
cattgggatt tctggggfcgc tgataatggt 
ctctacaaca atgccacctg cgttgcccgc 
attgcagaag cacaactgaa tgcgggcatg 
gagtgtagta actcggttca atcagggatt 

45 gtgtctgaag gtgcatafcac agcggcgcaa 
gtcagcgccc tggtctctcja taattatgac 
tacatttcta acagtgctca ccccgatacg 
gcgaagttaa cccccggagg aaatgcagcg 
ttagaaggtg cgactcaccc cggatactcc 

50 aagaaactgc caaatcagac agtcagtgcg 
gtgaaaatgc aatga 
<212> Type : DNA 
<211> Length : 133 5 

SequenceName : SEQ ID 451 

55 SequenceDescription : 

Sequence 



<213> OrganismName : Escherichia 
60 <400> PreSequenceString : 

atggaaaaca atcgcaattt ccctgccaga 
ctttgtattg gcatcacgcc tgtggctcag 
gatgacacgc tggttgtcga agcatcaacg 
gatccgaaat tctcgcgtcc ggtagcggat 
65 caagtgatta aagatcaggg cgcaaccaac 
gtgggtgcgt tttttgcggg tgagaacggt 
cgcggtgctg atacctctaa cagtatttat 



ggggtaaaag ttgccggtat cactcaggac 180 

cgtggtctgg acgttgctat cagccagaac 240 

tcggtgttct acagccgtaa cggacaattt 3 00 

atgcaaggtt tacagctgac gggttacccg 360 

ggggcgaatc cgactaacat ttcgatcccg 42 0 

acggcgtcga tgcagatcaa cctgaattcc 480 

gatgccagca atgcggatag ctataacaaa 540 

ggtaatgctc atgacatgag cgtctacttt 60 0 

tacacccagg atagcagtga tccaacaggt 660 

aatgccaatg gcgttctgac ctcaaatcca 72 0 

ggcgcagaac ccgccacgtt tagcctgagc 7 80 

gctaacaaca ttgtggcaac cacccagaat 840 

caaatcaatg atgacggtac ggttgtcggc 900 

gggcagattg tactggcgaa^ctttgccaac 960 

gtctggtctg cgacgcaatc ttctggcgtg 1020 

tttggcaccc tgaccaacgg tgcgttggaa 108 0 

gtcaatatga tcgttgccca gcgtaactat 1140 

gaccagatcc tcaacacgct ggttaactta 12 0 0 

12 06 



coli 0157:H7 



attagctcca tcatattggt ggcactaata 60 

acacagatag gtcaaactga agatggtcgg 120 

atgaccgaca cctattttgc gccagcaggt 180 

actaactaca catcaggtgg cgcgacagga 240 

gatttgccaa acatctattt tctggtcgct 3 00 

tacgatgcag gcgggcctga tggtctgagt 360 

ggtctcaagc agaccatggc gggcgtgacg 420 

accagttatg ccactcaggg aactaaaatc 480 

catgctgagc tttatcgcat cagtacgcta 540 

ggtaataata atacagatag tagtggagtc 60 0 

aactgtgfctc agcccaatgc ctatattcag 660 

catgatgagc ccggcgaaga tagttctgtt 720 

tttggttacg gaatgcgttc ggccaatcga 780 

agcgccacgc cgttagtatt gctgccgaca 840 

gaaagtaccg gtaattttaa tgtccgcgtc 900 

agcgatactc agacagcatt aggaatccag 960 

aaactgggga ttatcaatag caacggcggc 102 0 

gcagcagaga tggcaaaggg cgttgggatc 1080 

gcgatgacgc tggttggtca accgggcatc 1140 

gggtggtatc ctgtatttga aggggcaaca 12 00 

agctatagtt actcttttat cgcccggttg 12 60 

ggaaaagtgc gggcaacggc ttatatattg 1320 

1335 



COli 0157:H7 



caatttcatt cgctcacgtt ctttgccggt 60 

gcactcgccg ccgaagggca aactaacgcg 120 

ccttcgcttt atgcgccaca acaatctgcc 180 

actacccgca cgatgacggt gatttctgaa 240 

cttaccgatg cgctcaaaaa cgtccccggc 3 00 

aactccacca ctggcgacgc catttatatg 3 60 

attgatggca ttcgcgatat cggcagcgtc 420 
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tcgcgcgaca ccttcaatac tgagcaggtc gaagtgatta aagggccttc cggcaccgac 480 

tacgggcgca gcgcgccgac gggctcgatc aatatgatca gcaagcagcc gcgcaatgat 540 

tccggcattg acgcctccgc cagtattggc agcgcctggt tccgccgcgg cacgctggac 60 0 

gtcaatcagg tcattggtga taccactgcg gtgcgcctga atgtgatggg cgaaaaaacg 660 

5 cacgatgccg gacgcgacaa agtcaaaaat gagcgttacg gcgtcgcccc ttctatcgct 720 

tttggccttg gtacagcgaa tcgtttgtat cttaattatc tgcatgtcac ccagcacaac 78 0 

acgccagacg gcggcattcc aaccatcggt ttgccgggct attctgcccc atctgcagga 840 

acggcgaccc tgaatcattc cggaaaagtt gatacgcata acttttacgg cacggatfccc 90 0 

gattacgacg attcgaccac cgacaccgcc accatgcgtt ttgagcacga tatcaacgat 960 

10 aacaccacca ttcgcaacac tacccgttgg tcgcgcgtga agcaggatta cctgatgacg 102 0 

gcgattatgg gtggggcgtc gaatattacc cagcccacca gcgatgtgaa tagctggacg 1080 

tggtcacgca cggcgaatac caaagatgtg agtaataaaa ttctcaccaa ccagaccaac 1140 

ctgacctcga cattctatac cgcttctatc ggtcatgatg tcagtaccgg cgtggaattt 1200 

acccgtgaaa cgcagactaa ctacggcgtt aatccggtga cgttacctga ggtaaatatt 1260 

15 tatcatcctg acagcagcat tcatcccggc ggcctgacgc gcaacggcgc aaacgccaat 1320 

ggtcagacgg ataccttcgc aatttacgca ttcgatacgc tgcaaatcac ccgtgatttt 13 80 

gagctgaacg gcgggatccg tctggataat tatcatactg aatatgacag tgccaccgcc 1440 

tgcggcggca gcggacgcgg tgccatcacc tgcccagctg gtgtggcaaa aggttctccg 150 0 

gtcaccaccg tcgacaccgc caagtcgggc aatctggtga actggaaagc cggggcgctg 1560 

20 tatcacctga cggaaaacgg caatgtctat attaactatg ccgtttccca gcagcctccg 1620 

ggcggcaaca acttcgccct tgcgcagtct cjgcagcggta acagtgccaa ccgcaccgat 1680 

tttaaaccgc aaaaagccaa caccagcgag attggcacca aatggcaggt tctggataaa 1740 

cgcctgttgc tcaccgccgc gctgttccgt actgatatcg aaaatgaagt tgagcaaaat 180 0 

gatgacggga cttactcgca atacggtaag aaacgcgtcg aaggctatga gatatccgtg 18 60 

25 gcagggaata tcactcccgc gtggcaggtg attggcggct atacccagca aaaagcaacc 1920 

atcaaaaacg gcaaagatgt tgcccaggat ggttcctcat cgctgccgta taccccggag 1980 

cacgccttca ccttatggag ccaatatcag cjcaaccgatg atatctctgt tggcgcgggc 2040 

gcacgctata tcggcagtat gcataaaggt -bcagacggcg cggtgggaac gccagcgttt 210 0 

accgaaggtt actgggtcgc cgatgccaaa ctggggtata gggttaafccg caatctcgac 2160 

30 ttccagcfcaa acgtctacaa cctgfcttgat accgattacg tcgcctcaat caataagagc 2220 

ggctaccgtt atcacccggg cgagccaaga accttcttgc tcacagccaa tatgcatttc 2280 

tga . 2283 
<212> Type : DNA 
<211> Length : 2283 

35 SequenceName : SEQ ID 452 

SequenceDescription : 

Sequence 



40 <213> OrganismWame : Escherichia coli 0157:H7 
<400> PreSequenceString : 

atgcaaatga agaaattgct ccccattctt atcggcctga gcctttctgg gttcagttcg 60 

ttgagccagg ccgagaacct gatgcaagtt tatcagcaag cacgccttag taacccggaa 12 0 

ttgcgtaagt ctgccgccga tcgtgatgct gcctttgaaa aaattaatga agcgcgcagt 18 0 

45 ccattactgc cacagctagg tttaggtgca cjattacacct atagcaacgg ctaccgcgac 240 

gcgaacggca tcaactctaa cgcgaccagt cjcgtccctgc agttaactca atccattttt 3 00 

gatatgtcga aatggcgtgc gttaacgctg caggaaaaag cagcagggat tcaggacgtc 36 0 

acgtatcaga ccgatcagca aaccttgatc ctcaacaccg cgaccgctta tttcaacgtg 42 0 

ttgaatgcta ttgacgttct ttcctataca caggcgcaaa aagaagcgat ctaccgtcaa 48 0 

50 ttagatcaaa ccacccaacg ttttaacgtg ggcctggtag cgatcaccga cgtgcagaac 540 

gcccgcgcgc agtacgatac cgtgctggcg aacgaagtga ccgcacgtaa taaccttgat 60 0 

aacgcggtag agcagctgcg ccagatcacc ggtaactact atccggaact ggcggcgctg 660 

aatgtcgaaa actttaaaac cgacaaacca cagccggtta acgcgctgct gaaagaagcc 720 

gaaaaacgca acctgtcgct gttacaggca cgcttgagcc aggacctggc gcgcgagcaa 78 0 

55 attcgccagg cgcaggatgg tcacttaccg actctggatt taacggcttc tagcgggatt 840 

tctgacacct cttatagcgg ttcgaaaacc cgtggtgccg ctggtaccca gtatgacgat 9 00 

agcaatatgg gccagaacaa agttggcctg agcttctcgc tgccgattta tcagggcgga 960 

atggttaact cgcaggtgaa acaggcacag tacaactttg ttggtgccag cgagcaactg 1020 

gaaagcgcgc atcgtagcgt cgtgcagacc cjtacgttcct ctttcaacaa cattaatgct 10 80 

60 tctatcagta gtattaacgc ctacaaacaa grccgtagttt ccgctcaaag ctcattagac 1140 

gcgatggaag cgggctactc ggtcggtacg cgtaccattg ttgatgtgtt ggatgcgacc 12 0 0 

accacgctgt acaacgccaa gcaagagctg gcgaatgcgc gttataacta cctgattaat 12 60 

cagctgaata ttaagtcagc cctgggtacg ttgaacgagc aggatctgct ggcactgaac 13 20 

aatgcgctga gcaaaccggt ttccactaat ccggaaaacg ttgccccgca aacgccggaa 13 80 

65 cagaatgcta ttgctgatgg ttatgcgcct gatagcccgg cacccgtcgt tcagcaaaca 1440 

tccgcacgca ctaccaccag taacggtcat aaccctttcc gtaactga 1488 
<212> Type : DNA 
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<211> Length : 1488 

SeguenceName : SEQ ID 453 
SequenceDe script ion : 

5 Sequence 



<213> OrganismName : Escherichia coli 0157:H7 
<400> PreSequenceString : 

gtgaccaaac tcaaacttct ggcacttgga gtgcttatcg caacgtctgc aggcgtagcg 60 

10 cacgctgaag gtaaattttc cctgggcgca ggcgtaggtg tcgttgaaca cccatataaa 12 0 

gattacgata ccgatgttta cccagtaccg gtaatcaact atgaaggcga taacttctgg 18 0 

ttccgtggct taggtggtgg ttactacctg tggaatgacg caacggataa actttcaatt 240 

accgcttact ggtcgccgct ttacttcaaa gctaaagaca gtggcgatca ccaaatgcgt 3 00 

cacctggatg accgtaagag cacaatgatg gctggtctgt. cttatgctca ctttacccag 360 

15 tacggttacc tgcgtaccac cctggctggc gataccctgg ataacagcaa cggcatcgtc 42 0 

tgggatatgg cctggttgta tcgttacacc aacggtggcc tgaccgtgac tccgggtatt 480 

ggtgtgcagt ggaacagcga aaaccagaac gaatactatt atggcgtatc gcgcaaagag 540 

tccgctcgca gcggtctgcg tggctataac tcgaacgaca gctggagccc ttacctggag 600 

ctgagcgcca gctacaactt cctcggcgac tggagtgttt acggtaccgc gcgctacacc 660 

20 cgtctgtctg atgaagttac tgacagcccg attgtggata aatcctggac tggcctgatt 720 

tctaccggga tcacctacaa attctga 747 
<212> Type : DNA 
<211> Length : 747 

SeguenceName : SEQ ID 454 



25 SequenceDescription : 

Sequence 



<213> OrganismName : Escherichia coli 0157:H7 

30 <400> PreSequenceString : 

atgaaaaaaa cattactggc agccggtgcg gtactggcgc 
aacgcagctg aaaacgacaa accgcagtat ctttccgact 
, gttgtcggaa gctatcacac ccgfcttcgga ccgcagatcc 
tacgaagcat tcgctaaaaa agactggttc gacttctatg 

35 ttcttcggcg gtaactccga tgcaaaaggt atctggaacc 
gaaatcgaac cacgtttctc catcgacaag ctgaccaata 
ttcaaagagt ggtacttcgc gaacaactac atttacgaca 
cgccagagca cctggtacat gggtctgggt accgatatcg 
ctgtccatga acgtctatgc gaaataccag tggcagaact 

40 gagtgggacg gttaccgttt caaaattaaa tactttgtgc 
ggtcagctga gctacatcgg cttcaccaac ttcgactggg 
agcggtaacg caatcaacgg tattaagacc cgtactaata 
attctggctc tgaactacga tcactggcac tactctgtcg 
ggtggtcagt ggaacgacga tgcagaactg aacttcggca 

45 tctaccggct ggggtggtta cctggtagta ggttacaact 
<212> Type : DNA 
<211> Length : 885 

SequenceName : SEQ ID 455 
SequenceDescription : 

50 

Sequence 



<213> OrganismName : Escherichia coli 0157 :H7 
<4 00> PreSequenceString : 

55 atgctttcaa cacaatttaa cagggataat caatatcaag ccatcaccaa accgtcacta 60 

cttgccggtt gcatagcact ggcactatta ccttctgccg cttttgctgc accagccact 120 

gaagaaacgg tgattgttga gggjttcagcc acagctccag atgatggcga aaatgattac 180 

agcgtaacgt ctacctctgc gggtaccaaa atgcagatga ctcaacgtga tattcctcag 240 

tcggtcacta ttgttagcca gcagcggatg gaagatcagc agttacaaac gctgggcgaa 3 00 

60 gtgatggaaa acacgctggg gatcagcaaa agtcaggcgg attccgatcg tgctctttat 3 60 

tattcccgcg gattccagat cgataactat atggttgatg gtatccccac ctattttgaa 420 

tcgcgctgga atctgggcga cgcactttct gatatggcac tgtttgaacg cgtagaagta 480 

gtgcgtggcg cgacaggact catgaccggg acgggtaatc catctgcggc aattaatatg 540 

gttcgaaaac acgcgaccag tcgtgaattt aaaggcgatg tctcggcgga atacggtagc 600 

65 tggaacaaag aacggtatgt ggcggattta caaagcccac tcaccgaaga cggtaaaatc 660 

cgcgcgcgaa ttgtcggcgg ctaccagaat aacgactcat ggctggaccg ctacaatagt 720 

gaaaagacct tcttctccgg cattgtcgat gctgatttag gcgatcttac gacgctttca 780 



tctcttcgtc ttttactgtc 60 

ggtggcacca gagcgttaac 120 

gcaacgatac ctaccttgag 180 

gttatgcgga tgcgccggta 240 

acggttctcc tctgtttatg 3 00 

ctgaccttag cttcggtccg 3 60 

tgggtcgtaa taaagatggt 420 

acactggcct gccgatgagc 480 

atggcgcagc gaacgaaaac 540 

cgattaccga tctgtggggc 600 

gttccgattt aggggatgac 660 

actctatcgc ttccagccat 720 

tagctcgtta ctggcacgac 780 

acggcaactt caacgttcgc 840 

tctga 885 
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gccggttacg aatatcagcg cattgatgtt 
tggaatactg atggcagcag caacagttac 
gcgtacaacg ataaagagat caacaaggtc 
acctggcaag cgacactgaa tgccacccac 
5 tatgtcgatg cctatgtaaa caaagcggat 
ggacctggct ttgattatgt cggcggcacc 
gcgctggatt tgttcgctga cggtagttat 
tttggtggca gttacagcaa acaaaacaat 
ccggatgaaa ttggcagttt ctacaacttt 

10 ccacagagcc tggcgcagga cgataccaca 
gtcacccttg ccgatccgct gcatctgatc 
gatacgctga cttacagcat ggagaaaaac 
gacatcaatg acaactggtc gacctacgcc 
gatcgtgaca gttcaggcaa atatctggct 

15 ctgaaatcgg actggatgaa tagccgtctg 
caggataatg tcgctcagtc caccggtaca 
tataaagcgg tggatgggac agtcagtaaa 
accgacaact ggcagctgac atttggcgca 
aacgccgtta atcctaatct gccacgcacc 

20 cctgtcatgc cagagttgac agtcggcggt 
gacaccgtga caccgtatgg caccttccgc 
ctcttcaccc gctaccaggt gacgaaaaac 
ttcgacaaaa cctacgatac caacgtggaa 
ttcagcatta ccggcacgta tcaattctga 

25 <212> Type : DNA 

<211> Length : 2190 

SequenceName : SEQ ID 456 
SequenceDescription : 



30 



aatagcccaa 
gatcgcgcac 
tttatgaccc 
tctgaagtcg 
ggtatgctgg 
ggttggaaca 
gaattgtttg 
cgttacttca 
aatggcaatt 
catatgaaat 
ctcggcgcac 
cacaccacgc 
agctatacct 
ccaatcaccg 
accaccacgt 
cctatccccg 

ggggtggaat 

acgcgctata 
acggttaaaa 
ggtgttaact 
gccgagcaag 
ttctcgttac 
ggttctatcg 



cctggggcgg 
gcagtaccgc 
tgaagcagcg 
aatttgacag 
ttgggccata 
gtggcaaacg 
gfccgtcagca 
gttcatgggc 
tcccacaaac 
cgttatatgc 
gttataccaa 
cttacgctgg 
ctattttcca 
gtaa.caacta 
tagccatctt 
gcagcaacgg 
ttgaactcaa 
ttgcagagga 
tgttcaccag 
ggcaaaatcg 
gtagctacgc 
aggggaacgt 
tctacggcgc 



tttaccgcgc 840 

acctgactgg 90 0 

gtttgctgat 960 

caaaatgatg 1020 

cagtaattat 108 0 

taaagttgat 1140 

caatctaatg 12 0 0 

caacatcttc 1260 

cgactggtca 132 0 

tgccactcgt 13 8 0 

ctggcgggtt 1440 

tctggtgttt 1500 

gccgcaaaat 1560 

cgagctggg-t 16.2 0 

ccgtattgag 168 0 

cgaaaccgcc 174 0 

cggcgcaatt 18 0 0 

taacgaagga 1860 

ctatcggttg 192 0 

cgtgtatacc 198 0 

gttggtggat 2040 

caataacctg 2100 

accgcgtaat 2160 

2190 



Sequence 



<213> Organ ismName : Escherichia 
<400> PreSequenceString : 

atggcgcgat tccagtttaa aaaccgtaaa 

35 atggtgatgg gagaagctgc aattgctgca 

gtaacgccag ttgcccaatt atctttacag 

cccggtgtca ctcaacaggc ggcgcagata 

aaaagtgcct ggtatccgca cgttggatta 

gactccagcg gcaggcttga taacaacgtt 

40 tatgactttg gtaaaactaa caacgatatc 

cgctttaaat tgatggcaac cttaaccgat 

gaagtcagtc gttatcaggc tttgtgcgat 

aacgtctaca acatggcggc attgcgcgct 

ttacaggccc agacgcgtat tgccggaatg 

45 atggcgagcg ccaaagcgca actggcggtg 

gcgccacctg ctgaattagc cgagcagccg 

atcccgctgg tgctggcggc agaaaactta 

acgaaagcgc aatactggcc aacgctcagt 

agcgaccgct cgtattggga tgatcagcta 

50 ggcggcgcgg tttctgccca ggtgcaacag 

caggtcgaac aggcaaaact ggatgtgctg 

accggcgcac ggggtcgtga agaagccggt 

cgagatgtgt accaaaatga atataaattg 

gtcgaacaag atgtctttca ggcgcaatcg 

55 gtcgccgccg taaattatgc cgctgcggtg 

cagggcttat acaacgactt acccgacttg 
<212> Type : DNA 
<211> Length : 13 56 

SequenceName : SEQ ID 457 

60 SequenceDescription : 



coli 0157:EC7 



aataatggac 
ccgctgccac 
gaaagtatat 
ggtattggcg 
accggcaacg 
tcgtatggca 
aatctgcaaa 
gttgcagaga 
gcggctcaac 
aacgcaggcc 
cgctcaacgc 
ctcactggcg 
gtatcgctga 
cgccagtcag 
attcaggggg 
caactgaacg 
gccgaagggc 
caacgagcgt 
ttagcgcaat 
ggtaaacgca 
gctgaaataa 
aataacctta 
aaataa 



ttattttttt 


tataagtttt 


60 


aatgggcgaa 


tgccccagcg 


120 


tacgcgcctt 


tgcgcgaaat 


180 


aagcgcaaat 


tgatgaagcc 


240 


cggggccgtc 


ccgacaaacg 


300 


taaccctgac 


acaactggtg 


360 


ctgccgcccg 


tgacagctac 


420 


aaacggcgac 


fcgcctacatg 


480 


gcaatattca 


ctcgctggaa 


540 


tgaactcatc 


gtcggatgaa 


600 


tggagcaata 


tcaggcgcag 


660 


tacagccgga 


ggcgatagcc 


720 


agaatattga 


ttaccagtcc 


780 


cacagtacgg 


cgtggaaaaa 


840 


gtaaaacgcg 


ctaccagacc 


900 


ttaacgcgcc 


gctgtatcag 


960 


aacaaaaaat 


ctctgcctcg 


1020 


ctgtggcata 


tgcgaactgg 


1080 


ccgaaagtgc 


gcacaaaacg 


1140 


gtctgaatga 


tctattaacc 


1200 


atgccaatta 


tgatggctgg 


1260 


ttccattagc 


gggaattaaa 


1320 






1356 



Sequence 

<213> OrganismName : Escherichia coli 0157 :H7 
65 <4 00> PreSequenceString : 

atggctaagt tcacaccttc attctcagga atcaaaggtc gggcgctctt ttcactgctc 
tttgcagcac caatgattca tgcaaccgac actgcaacga ccaaagatgg cgaaacaatc 



60 
120 
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actgttacag cggatgcaaa taccgcaact gaggccjaccg atggttatca acctctgagc 180 

acttccacgg cgacattaac cgatatgccg atgctggata tcccgcaggt ggtcaatacg 240 

gttagcgatc aggttctgga aaatcagaat gcaaccfacgc tggatgaagc gctttataac 3 00 

gtcagtaacg tggtacagac caatacatta ggcggaactc aggacgcctt tgtacgtcgt 3 60 

5 ggttttggtg ctaaccggga tggctccatc atgaccaacg gcctgcgaac tgtacttcct 42 0 

cgcagtttca acgccgccac agaacgtgtg gaagtcjctaa aaggtccggc ctccacgctg 4 80 

tatggcattc tcgatcctgg tggattgatt aacgtcgtga ccaagcgccc ggaaaaaaca 540 

ttccatggtt ctgtctcagc cacgtcctcc agttttggcg gaggcactgg gcaacttgat 60 0 

atcacaggtc ccattgaagg cactcagctg gcataccgcc tgacggggga agtgcaggat 660 

10 gaagattact ggcgaaactt cggtaaagag cgcagtacat ttattgcccc gtcactcacc 72 0 

tggtttggtg ataatgcaac agtaaccatg ctctafctccc atcgggacta taaaactccg 780 

ttcgatcgtg gaacgatttt cgaccttacg acgaaacagc ccgtaaacgt tgatcgaaaa 840 

atacgttttg acgaaccgtt taatattaca gatggtcagt ccgatctggc gcaactcaac 90 0 

gcagaatatc. atctcaatag ccagtggaca gcgqgctttg attacagcta cagccaggat. 96.0. 

15 aaatacagcg acaatcaggc tcgcgttacc gcgtatgatg caacgacagg aacgctgaca 102 0 

cggcgtgttg atgcaactca gggatctacc cagcgtatgc attctactcg tgcggatctg 10 80 

caagggaatg ttgatattgc tgggttctat aatgacjattc tgggtggggt gtcatatgaa 1140 

tattatgatc ttctgcgcac agatatgatt cgctgtaaaa acgctaaaga tttcaatatc 12 0 0 

tacaaccccg tttatggcaa taccagcaaa tgcacaacgg tttcggcgtc ggacagtgat 1260 

20 cagacgatca aacaggagag ctactcagct tatgcscagg atgcgctcta tctgaccgat 132 0 

aactggattg ccgtcgccgg gatccgctat cagtattaca cgcagtatgc gggtaaaggc 13 80 

cgtcctttta atgtcaatac tgacagccgc gatgaacaat ggacgcccaa actggggtta 1440 

gtctacaaac tgacgccatc ggtatcctta tttgctaatt attcgcaaac atttatgccg 1500 

caatcgtcaa ttgccagcta catcggagat cttccaccgg aatcatctaa tgcttacgaa 1560 

25 gtcggggcaa aattcgagct gttcgatggt ataaccgcag atattgcgct gtttgatatc 1620 

cataaacgta acgtgttgta taccgaaagt attggtgatg aaaccatcgc caaaacggca 1680 

ggccgcgttc gttcaagagg ggtagaagtc gaccttgcgg gagcattaac tgaaaacatt 1740 

aatatcattg ccagctacgg ctataccgat gctaacjgttc tggaagatcc tgattatgca 1800 

gggaaaccat tgccgaatgt tcctcgtcat accggfctcgc tattcctgac ctatgacatt 18 60 

30 cataacatgc caggcaataa cacactgacg tttggcggtg gtggacattg cgtaagccgt 1920 

cgttcggcaa ccaatggggc tgactattat ctgccaggct atttcgttgc cgatgccttc 198 0 

gccgcataca aaatgaaatt gcagtatccg gtcacactgc aattaaacgt caaaaacctg 2040 

tttgataaaa cgtattacac ctcttccatc gccacraaata atctggggaa tcagattggc 210 0 

gatccgcgtg aagtgcaatt cacggtgaaa atggaatttt ga 2142 

35 <212> Type : DNA 

<211> Length : 2142 

SequenceName : SEQ ID 458 
SequenceDescription : 



40 Sequence 



<213> OrganismName : Escherichia coli 0157:H7 
<400> PreSequenceString : 

atgcggactc tgcagggctg gttgttgccg gtgtttatgt tgcctatggc agtatatgca 60 

45 caagaggcaa cggtgaaaga ggtgcatgac gcgccagcgg tgcgtggcag tattatcgcc 120 

aatatgctgc aggagcatga caatccgttc acgctctatc cttatgacac caactacctc 18 0 

atctacaccc aaaccagcga tctgaacaaa gaagcgattg ccagttacga ctgggcggaa 240 

aatgcgcgta aggatgaagt aaagtttcag ttgagcctgg catttccgct gtggcgtggg 3 00 

attttaggcc cgaactcggt gttgggtgcg tcttatacgc aaaaatcctg gtggcaactg 3 60 

50 tccaatagcg aagagtcttc accgtttcgt gaaaccaact acgaaccgca attgttcctc 420 

ggttttgcca ccgattaccg ttttgcaggt tggaccjctgc gcgatgtgga gatggggtat 48 0 

aaccacgact ctaacgggcg ttcagacccg acctcccgca gctggaaccg cctttatact 540 

cgcctgatgg cagaaaacgg taactggctg gtagaagtga agccgtggta tgtggtgggt 60 0 

aatactgacg ataacccgga tatcaccaaa tatatcjggtt actaccagct taaaatcggc 660 

55 tatcacctcg gtgatgcggt gctcagtgcg aaaggacagt acaactggaa caccggctac 72 0 

ggcggcgcgg agttaggctt aagttacccg atcaccaaac atgtgcgcct ttatactcag 780 

gtttacagcg gctatggcga atcgctcatc gactataact tcaaccagac ccgtgtcggt 840 

gtgggggtta tgctaaacga tttgttttga 870 
<212> Type : DNA 

60 <211> Length : 870 

SequenceName : SEQ ID 459 
SequenceDescription : 



Sequence 

65 

<213> OrganismName : Escherichia coli 0157:H7 
<4 00> PreSequenceString : 
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atggcggttc aaaagaatgt tatcaaaggc atactggcag gtacgtttcjc gctaatgctg 60 

agcggttgtg tcactgtgcc ggacgccatt aaaggcagca gtcccacgcc gcaacaagat 12 0 

ttagttcggg tgatgagtgc gccgcagctg tacgttggtc aggaggcacg ctttggtggc 180 

aaagtggttg cggtacaaaa ccagcaaggg aaaacccgcc tggaaattgc taccgtaccg 240 

5 ctggacagcg gagccagacc gacgctggga gaaccttctc gcggtcgcat ttatgccgat 3 00 

gtgaacggtt ttctggaccc ggtggatttc cgtggacaac tggttacggfc agtcgggcca 3 60 

atcactggtg cggttgacgg aaaaatcggc aacacgccct ataaatttett ggtgatgcag 420 

gcaacaggtt acaaacgttg gcatttaacc cagcaggtga ttatgccgcc tcagccgatt 480 

gatccctggt tttatggcgg tcgtggctgg ccctatggcc acggcggatg gggctggtat 540 

10 aatcccggcc ccgcgagagt acaaacagtt gtaactgaat aa 5 82 
<212> Type : DNA 
<211> Length : 582 

SequenceName : SEQ ID 460 
SequenceDe script ion : 



15 



45 



60 



Sequence 



<213> OrganismName ; Escherichia coli 0157:H7 
<4 00> PreSequenceString : 

20 atgcgtaagc agtggctcgg gatctgcatc gcggcaggaa tgctcgcggjc atgtacaagc 60 

gatgatggtc agcaacagac agtaagtgta ccgcagcctg cggtatgtaa cggccccata 12 0 

gttgaaatta gcggggcgga cccgcgtttc gaaccactga acgcgacggc aaatcaggat 180 

taccagcgcg acggtaaaag ctacaaaatc gtgcaggatc cgtctcgatt tatccaggcg 240 

ggactggcgg caatctatga tgccgaacca ggcagtaacc tgacggcctc tggcgaagct 3 00 

25 ttcgatccga cacagctgac ggcggcccat ccaacgcttc cgatccccag ctacgccaga 3 60 

atcactaacc tggctaacgg gcgaatgatc gtggtgcgca ttaatgatcg cggtccttac 420 

ggcaacgacc gcgttatttc gctttctcgc gcagcagctg accgtcttaa cacgtcaaac 480 

aacaccaaag ttcgtatcga tccgattatt gtcgcccagg atggttcgct ttctggtcct 540 

ggtatggctt gtaccacagt cgccaaacag acttacgccc tgcctgcacc tcccgattta 600 

30 agcggtggcg cgggaacaag ttcagtgtct ggcccgcagg gtgacattct tccggtcagt 660 

aattcgacgc taaaaagcga agatccgacc ggcgcgccgg taaccagcag cggtttcctc 72 0 

ggcgcaccaa cgaccttagc gccaggtgta ctggaaggca gcgaaccgac gcctgctcca 780 

cagcccgttg ttacagctcc gtcgacaacg cctgcaacct cgcctgcaat ggtgacaccg 840 

caagccgcct cgcaaagcgc cagcggcaac tttatggtgc aggtcggggc cgtaagcgat 90 0 

35 caggctcgtg cgcaacagta ccaacagcaa ctgggacaga agttcggccgt ccccggtcgc 960 

gtaactcaaa atggcgcggt ctggcggatc cagcttggcc cattcgccaa caaagccgaa 1020 

gccagtacct tgcagcaacg tttgcaaacc gaagcccaat tacagtcafet tattaccacc 108 0 

gcgcagtag 1089 
<212> Type : DNA 

40 <211> Length : 1089 

SequenceName : SEQ ID 461 
SequenceDescription : 



Sequence 



<213> OrganismName : Escherichia coli 0157:H7 
<400> PreSequenceString : 

atgattaaac gcgtattggt tgtttcaatg gtaggtctgt ctcttgtcgg ttgtgttaat 60 
aacgacaccc tgtcagggga tgtttatacc gcttctgaag cgaaacaagt acagaatgtc 12 0 

50 agctatggaa ccatcgttaa cgtacgtccg gtacagattc agggcggtcfa tgattccaac 180 
gttatcggtg caattggcgg tgctgttctt ggtggtttcc tgggaaatac tgttggtggc 240 
ggaaccgggc gttctctggc tactgcagca ggcgctgttg caggtggcgt agctggtcag 3 00 

ggcgtacaga gtgcaatgaa caaaacgcaa ggtgtcgagc tggaaattcg taaagacgat 360 
ggtaatacca tcatggtggt acagaaacaa ggcaacactc gtttctctcc gggccaacgt 420 
55 gtcgtactgg ccagcaatgg cagtcaggtg accgtttctc cgcgctaa 468 
<212> Type : DNA 
<211> Length : 468 

SequenceName : SEQ ID 462 
SequenceDescription : 



Sequence 



<213> OrganismName : Escherichia coli 0157:H7 
<400> PreSequenceString : 
65 gtgtcaaagg caaccgaaca aaacgacaag ctcaagcggg cgataattat ttcagcagtg 60 
ctgcatgtca tcttatttgc ggcgctgatc tggagttcgt tcgatgagaa tatagaagct 120 
tcagctggag gcggcggtgg ttcgtccatc gacgctgtca tggttgattc aggtgcggta 180 
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gttgagcagt acaaacgcat gcaaagccag gaatcaagcg cgaagcgttc tgatgagcag 240 

cgcaagatga aggaacagca ggctgctgaa gaactgcgtg agaaacaagc ggctgaacag 300 

gaacgcctga agcaacttga gaaagagcgg ttagctgctc aggaacagaa aaagcaggct 3 SO 

gaagaagccg caaaacaggc cgagttaaag cagaagcaag cggaagaggc ggcagcgaaa 42 0 

5 gcggcggcag atgctaaagc gaaggccgaa gcggatgata aagctgcgga agaagcagcg 48 0 

aagaaagcgg ctgcagacgc gaagaaaaaa gcagaagcag aagccgccaa agccgcagcc 540 

gaagcgcaga aaaaagccga ggcagcagct gcggcgctga agaagaaagc ggaagcggca 6 00 

gaagcagctg cagctgaagc aagaaagaaa gcggcagcag agaaagctgc agccgacaaa 660 

aaagcagcag aaaaagcggc tgctgaaaag gcagcagctg ataagaaagc agcggcagaa 72 0 

10 aaagccgccg cagacaaaaa agcggcagct gcaaaagcag cagctgaaaa agccgctgca 780 

gcaaaagctg ccgcggaggc agatgatatt ttcggtgagc taagctctgg taagaatgca 840 

ccgaaaacgg ggggaggggc gaaagggaac aatgcttcgc ctgccgggag tggtaatact 900 

aaaaacaatg gcgcatcagg ggccgatatc aataactatg ccgggcagat taaatctgct 960 

atcgaaagta-agttctatga cgcatcgtcc tatgcaggca aaacctgtac gcfcgcgcata 10 20 

15 aaactggcac ccgatggtat gttactggat atcaaacctg aaggtggcga tcccgcactt 1080 

tgtcaggctg cgttggcagc agctaaactt gcgaagatcc cgaaaccacc aagccaggca 1140 

gtatatgaag tgttcaaaaa cgcgccattg gacttcaaac cgtaa 1185 
<212> Type : DNA 
<211> Length : 1185 

20 SequenceName : SEQ ID 463 

SequenceDescription : 



Sequence 



25 <213> OrganismName : Escherichia coli 0157 :H7 
<4 00> PreSequenceString : 

gtgatgaaat ttaaaaaatg tcttctgcct gtggcaatgt tagcgtcatt cactctggca 60 

ggatgccagt caaatgctga cgatcatgcc gccgatgttt atcaaaccga tcaactgaat 120 

accaaacaag aaactaaaac cgttaatatt atttccattc ttcccgcaaa agttgccgta 180 

30 gacaactccc aaaataaacg gaacgcacaa gccttcggcg cgcttattgg cgcagtcgct 240 

ggcggtgtta tcggccacaa cgtcgggtct ggcagcaatt ccggaacgac ggcaggtgca 3 00 

gttggcggcg gagctgtagg cgcggcagcg ggttctatgg tgaatgataa aaccttagtg 3 60 

gaaggtgttt ctfctaacgta taaggaaggc accaaagtgt atacctctac tcaggtgggt 420 

aaagagtgcc agtttacgac aggtttagcc gttgttatta ccacgacgta taacgaaacg 480 

35 cgtattcagc caaataccaa atgtcctgaa aagagctaa 519 



<212> Type : DNA 

<211> Length : 519 

SequenceName : SEQ ID 464 
SequenceDescription : 

40 

Sequence 



<213> OrganisraName : Escherichia coli 0157:H7 
<400> PreSequenceString : 

45 atgttgctta gcataatcac tgtcgcgttt cgtaacctcg aagggatagt caaaacacat 60 

gcctcgctgg cgcatctggc gcaggcggaa gatatcagct tcgaatggat tgttgtcgat 120 

ggcggttcca atgacggcac tcgtgagtac ctggaaaatc tcaatggtat ctataaccta 18 0 

cgctttgtca gcgagccaga taacggtatc tacgacgcca tgaacaaggg cattgcgatg 240 

gcacaaggca agttcgcgtt gtttctcaac tcgggcgata tttttcatca ggatgccgca 3 00 

50 tattttgtcc gtaagttaaa aatgcaaaaa gataacgtga tgatcaccgg cgatgcgctg 360 

ctggattttg gcgacgggca taaaattaaa cgtagcgcca aaccgggctg gtaitatttat 420 

cacagcctgc ccgccagtca tcaggcgata tttttcccgg tatccggttt gaaaaaatgg 480 

cgttatgacc tggaatataa agtttcttct gattacgcgc tggcagccaa aatgtataaa 540 

gcaggttatg catttaaaaa actcaatggc ctggtgtctg aattttctat gggtggggta 60 0 

55 tctaccacca ataatatgga attgtgtgct gacgcgaaaa aagtccaacg gcaaatatta 660 

catgtgcctg gcttttgggc tgaattatcc tggcatttac gccagcgtac tacctcaaag 72 0 

acgaaagcct tatataacaa aagctga 747 
<212> Type : DNA 
<211> Length : 747 

60 SequenceName : SEQ ID 465 

SequenceDescription : 



* Sequence 



65 <213> OrganisraName : Haemophilus influenzae Rd 
<40 0> PreSequenceString : 

atgaaattaa caacactgca aaccttgaaa aaagggttta cattaatcga gctaatgatt 60 
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gtgattgcaa ttattgctat tttagccacc atcgcaattc cttcttafcca aaattatacc 120 

aaaaaagctg cggtatccga attactgcaa gcgtctgcgc cttataaggc tgatgtggaa 180 

ttatgtgtat atagcacaaa tgaaacaaca agctgtacag ggggaaaaaa tggtattgca 240 

gcggatataa agacagcaaa aggctatgta gcctcagtta tcactcaatc aggtggtatt 3 00 

5 acagtaaaag ggaatggcac attggcaaat atggaatata ttttgcaagc taaaggtaat 360 

gctgcagcag gtgtaacttg gacaacaacc tgcaaaggaa cggatgcctc tttatttcca 42 0 

gcaaattttt gcggaagtgt cacaaaatga 450 
<212> Type : DNA 
<211> Length : 450 
10 SequenceName : SEQ ID 466 

SequenceDescription : 



Sequence 

15 <213> OrganismName : Haemophilus influenzae Rd 
<400> PreSequenceString : 

atgctaaata aaaaattcaa actcaatttt attgcgctta ctgtcgccta cgcattaacc 60 

ccttatacag aagctgcgtt agtgagagac gatgtggatt atcaaatatt tcgtgatttt 120 

gcagaaaata aagggagatt ttctgttggt gcaacaaatg tggaagtcyag agataaaaat 180 

20 aaccactctt taggcaatgt tttacctaat ggcattccga tgattgattt tagtgttgtg 240 

gatgtagata aacgcatcgc cacattgata aatccacaat atgtagtagg tgtaaaacac 3 00 

gttagtaacg gcgtgagtga actacatttt gggaacttaa atggcaatat gaataatggc 3 60 

aatgctaaat cgcaccgaga tgtatcttca gaagaaaata gatatttttc cgttgagaaa 42 0 

aatgagtatc caactaaatt gaatggaaaa gcagtaacta ctgaagatca aactcaaaaa 48 0 

25 cgccgtgaag actactatat gccacgtctt gataaatttg ttaccgaagt tgcaccaata 540 

gaggcttcaa ctgcaagtag tgatgctggc acatataatg atcagaataa atatcctgct 600 

tttgtaagac taggaagtgg tagtcaattt atttataaaa aaggagataa ttacagctta 660 

attttaaata atcatgaggt tggaggcaat aatcttaaat tggtgggcga tgcctatacc 720 

tatggtattg caggcacacc ttataaagta aaccacgaaa ataatggact aattggtttt 78 0 

30 ggcaattcaa aagaggaaca cagcgatcca aaaggaatat tatctcaaga tccgcttacc 840 

aattatgctg ttttaggcga cagtggctcc ccattatttg tatatgatag agaaaaagga 900 

aaatggcttt ttcttgggtc ttatgatttt tgggcaggtt ataacaaaaa atcttggcaa 960 

gaatggaata tttataaacc tgaatttgca aaaactgttc tagataaaga tactgcaggt 1020 

tctttaactg gttctaacac ccaatacaat tggaatccta ctggcaaaac aagcgfctatt 10 80 

35 tctaatggtt ctgaatctct aaatgttgat ttattcgata gtagtcagga tactgactct 1140 

aagaagaaca atcacggaaa aagtgtgact cttagaggaa gtggaacgct taccttaaat 120 0 

aataatatcg atcaaggcgc aggcggcttg ttctttgaag gagattafcga agttaaaggc 1260 

acttctgata gtaccacttg gaaaggagct ggcgtttctg ttgctgatgg aaaaacagta 1320 

acgtggaaag tacataaccc gaaatctgat cgtttagcta aaatcggcaa aggaacatta 13 80 

40 attgtagaag gaaagggaga aaataaaggt tcgctaaaag tgggcgatgg tactgttatc 1440 

ttaaaacaac aagctgatgc caataataaa gttaaagcct tttcacaagt aggtatagta 15 00 

agtggtcgct caactgttgt acttaatgat gataagcaag tagatccaaa ttccatttac 1560 

tttggcttta gaggtggtcg attagatgcc aatggcaata atctcacttt tgaacatatc 162 0 

cgtaatattg atgatggcgc aagactagta aatcacaata ccagcaaaac ctctactgta 168 0 

45 acaattactg gggaaagtct aattacagat ccaaatacaa ttactccata taatatagac 1740 

gcaccagatg aagataatcc ttatgccttt cgacggatta aagatggagg acagctctat 18 00 

ttaaatttgg aaaattacac ttattatgcg ttaagaaaag gtgcgagcac tcgttcagaa 1860 

ttacctaaaa atagtggcga aagcaatgaa aattggctat atatgggtaa aacttccgat 1920 

gaagccaaaa gaaatgtaat gaaccatatc aacaacgagc gtatgaatgg ctttaacggt 1980 

50 tattttggcg aggaagaggg taaaaataac ggtaatctaa atgtgacttt taaaggcaaa 2 040 

agtgagcaaa atcgcttttt attaacaggc ggaacaaacc ttaatggcga tttaaaggtt 210 0 

gaaaaaggca cattattcct ttctggcaga ccaacaccgc acgcaagaga tattgcaggt 2160 

atttcttcga caaaaaaaga tcaacacttt gctgaaaata atgaagtcjgt agtagaagat 222 0 

gactggatta accgcaattt taaagcaaca aatattaatg taaccaataa cgcaaccctt 22 8 0 

55 tattcaggtc gcaatgttgc aaacattact tcaaatatca cagcttctga taatgcaaaa 2340 

gtacatattg gctataaagc aggcgatacc gtttgtgtac gttctgacta tacgggctat 2400 

gtgacttgca ctactgacaa gttatccgat aaagccctta atagctttaa cgccaccaat 2460 

gtatctggca atgtaaattt atcaggtaat gcaaactttg tcttaggcaa agctaactta 252 0 

ttcggcacaa ttagcggcac gggaaatagc caagtacgtt taaccgaaaa tagccattgg 2580 

60 catttaacag gcgatagcaa tgttaatcag ttaaatttag acaaggggca tattcattta 2640 

aatgcacaaa acgatgcaaa taaagtaact acatataaca cgctgactgt gaatagctta 270 0 

tcaggtaacg gttctttcta ttatttaact gatctttcca ataaacaagg cgacaaagtt 2760 

gttgtaacta aatccgccac aggtaacttt acattacaag tggcagataa aacaggcgag 282 0 

cctacaaaaa atgaactcac gctttttgat gcgtcaaatg ctacaagaaa taatttgaat 2880 

65 gtgtcattag ttgggaatac cgttgattta ggtgcttgga aatataaatt acgtaatgtt 2940 

aatggacgtt acgatttgta taacccagag gtggaaaaaa gaaatcaaac tgtcgatacg 30 0 0 

acaaatatca caacacctaa taatattcaa gctgatgtgc ctagcgtacc aagtaacaat 3060 
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gaagaaatag cccgtgttga aacaccagtt ccaccacctg cgcctgctac accatcagag 3120 

acaactgaaa cagtggctga aaatagtaag caagaaagta aaacagtaga gaaaaacgag 3180 

caagacgcaa ccgagacaac agctcaaaat ggagaagttg cagaagaagc taaaccaagt 3240 

gtaaaagcta atactcaaac aaatgaagtg gctcaaagtg gaagtgaaac cgaggaaact 33 00 

5 caaacgactg aaataaaaga aacagctaaa gtagaaaaag aggaaaaggc taaagtagaa 3350 

aaagatgaaa ttcaagaagc acctcaaatg gcttctgaaa cgtctccgaa acaagcaaag 3420 

cctgctccta aagaagtttc aactgatacg aaagtagaag aaactcaagt tcaagctcaa 3480 

ccgcaaacac aatcgacaac tgttgctgcg gcagaggcaa cttcgccaaa cagtaaacca 3540 

gcggaagaaa ctcaaccaag tgaaaaaact aacgctgaac ctgtaacgcc tgtagtatca 3 600 

10 aaaaatcaaa cagaaaatac gaccgaccaa ccaacagaaa gagagaaaac ggctaaagta 3 660 

gaaacagaga aaactcaaga accccctcaa gtggcttctc aagcgtctcc gaaacaggaa 3720 

cagtctgaaa ctgttcaacc gcaagcagtg cttgaaagtg aaaatgttcc gactgttaat 3780 

aatgcagaag aagttcaagc tcaactgcaa acacaaacaa gtgcaacagt aagcactaaa 3 840 

caacctgcac cagagaattc aataaatact ggatctgcaa ccgcaataac agaaactgct* 3 900 

15 gaaaaatccg ataaaccaca aacggaaact gcggcttcga ctgaagatgc tagtcagcat 3 960 

aaagcgaata ctgttgcgga taattctgta gcaaataatt cagaaagcag tgatccaaag 4020 

agtagacgta gaagaagtat tagccagcct caagagactt ctgctgaaga aacaacagca 40 80 

gcttctactg acgaaacaac aatagctgat aattcaaaac gcagtaagcc aaatcgtaga 4140 

agtagaagaa gtgttcgctc ggaaccaact gttacaaatg gcagcgatcg ttctacagta 4200 

20 gcattgcgcg atctcacaag tacaaacaca aatgcggtaa tttctgatgc aatggcaaaa 4260 

gcacaatttg ttgcattaaa tgtggggaaa gcagtttctc aacatattag ccagttagaa 432 0 

atgaataacg aggggcaata taacgtttgg gtatctaata cttcaatgaa cgaaaattat 4380 

tcctcaagtc aatatcgtcg ttttagttct aaaagtacgc aaactcaact tggttgggat 4440 

caaacaatct caaacaatgt tcagttaggt ggcgtgttta cttatgttcg caatagtaac 4500 

25 aactttgata aggcaagcag taaaaatact ctagcacaag ttaatttcta ttctaaatat 4560 

tatgcggata atcattggta tttgggcatt gatttaggct acggcaagtt ccaaagcaac 4620 

ctaaaaacca atcataatgc gaaatttgct cgccatactg cacaafcttgg tttaaccgca 4680 

ggcaaagcat ttaatcttgg caattttggt attacgccaa tagtaggcgt gcgttatagc 4740 

tatttatcaa acgctaattt tgcattagct aaagatcgca ttaaagtaaa tccaatatct 48 00 

30 gtcaaaacag cctttgctca agttgattta agttatactfc atcacttagg cgagttttcc 4860 

gttacgccaa ttttgtctgc tcgatatgat acaaatcaag gcagcggaaa aattaatgta 4920 

aatcaatatg attttgctta caacgtggaa aaccaacagc aatataacgc agggcttaaa 4980 

ttgaaatatc ataatgtgaa attaagtcta ataggcggat taacaaaagc gaaacaagcg 5040 

gaaaaacaaa aaactgcaga attaaaacta agttttagtt tttaa 5085 

35 <212> Type : DNA 

<211> Length : 5085 

SequenceName : SEQ ID 467 
SequenceDescription : 

40 Sequence 



<213> OrganismName : Haemophilus influenzae Rd 
<400> PreSequenceString : 

atggcattag taaacaaaat taaaacatta tcatcagtag gtattctagc ggctacatta 60 

45 tttcttgcag gctgccaagc acaatcaaat atattagcat ttacaccgcc tgcaccaagt 120 

gcttcaatga atgttaatcg aactgccgtt gtatctgtga caacaaaaga tagccgtgca 180 

atacaagaga ttgcgagtta tacgaaacac ggggaactga ttaaattaaa tgcatcccca 240 

agtgttacac aattatttca gcaagtgatg cagcaaaatt taattagtaa aggttttaga 300 

gttgggcaat taaatggttc aaatgcgtgg gtaactgtgg atgtgcgtga atttggtacg 3 60 

50 caagtagaac aaggtaatct tcgttataaa cttaatacca aaattcaagc gacagtttat 420 

gtacaaggtg cgaaaggttc gtataataaa tcatttaatg tcacgcactc acaagagggc 480 

gtatttaatg cgggcaatga tgaaattcat aaagtgctat ctcaaacttt taatgatatt 540 

gtgaacaata tttatcaaga tcaagaagtt gcggctgcga taaaccaata ttctaattaa 600 

55 <212> Type : DNA 

<211> Length : 600 

SequenceName : SEQ ID 468 
SequenceDescription : 

60 Sequence 



<213> OrganismName : Haemophilus influenzae Rd 
<400> PreSequenceString : 

atgctgtgct ggattggcta caaaaatggg attttaccgc agcagaacag cacgctatat 60 

65 ccttggctga atccgtcaaa gtgcggcgta atttttgatg gttttcaact tgtgggcgat 12 0 

gatttcaatt cagatcaaac ggctgaaaat acatcgccag cttggcaagt gctttacaca 18 0 

acccatttac aaagttgctc gccgattcat agtggagaaa atttcgcacc catccctttg 240 
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tataaacaac taaaaaatca accgcaccta 
tggcaagcct gcgatcaact acaaatgaat 
gaaatttccg atcatcaaag tacgctttca 
gaaaaagaaa ctggcatacc gacttactat 
5 gaatctgaaa aatcccgttg ctgcccttct 
atttttgata cctttcattt taaatgcgat 
aattttttgt aa 
<212> Type : DNA 
<211> Length : 612 
10 SequenceName : SEQ ID 469 

SequenceDescription : 

Sequence 



15 <213> OrganismName : Haemophilus 
<400> PreSequenceString : 
ttgggtgcat ttgcctttgc ttctgttaca 
ggtttatctc aaactaaagc aaacggtagt 
gtgggttata aagtaggaaa tacacgtgtt 

20 gatggcacaa aaattcaagg tttaggtgca 
aaagtgcaac cttatgttgg tgctcgtgta 
gcagaacaaa agtttaaaag ttcttctgat 
gcaaaatata agttagatgg caactggtac 
ggtaattttg atagtaccaa agttaataac 

25 ttctaa 

<212> Type : DNA 
<211> Length : 486 

SequenceName : SEQ ID 470 
SequenceDe script ion : 

30 

Sequence 



<213> OrganismName : Haemophilus 
<400> PreSequenceString : 

35 atgaaaaaac ttctaatcgc aagtttatta 
ccttttgtgg caaaagatat tcgtgtggat 
cgagcaagtt tacctgttcg tgctggtcag 
gtccgctctt tattcgtaag tggtcgattc 
gtgcttgfctg ttagcgttgt ggctaaatcg 

40 tctattattc ccactgaagc actaaaacaa 
gatgttttaa ttcgagaaaa attaaatgaa 
agtgtaggtc gctataacgc aaccgttgaa 
gctgaaattt taattcaaat caatgaagat 
aaggggaacg aatctgttag tagcagtaca 

45 tcttggtgga aattatgggg aaataaattt 
tcaattcgtg attattattt aaataatggc 
gttcagctaa atgatgaaaa aacaaaagtt 
cagtatgacc ttcgtagtgc acgcattata 
gaacctttac tgtcagcatt acatttaaat 

50 gtagaaaatg caattaaagc aaaacttgga 
tcagtacctg attttgatga tgcaaataaa 
ggacgacgtt taactgttcg ccaacttcgc 
actttacgtc aggaaatgcg ccaacaagaa 
ttaggaaaaa ttcgcttaga tcgtacaggt 

55 cctatcaatg gtagtaatga tgaagtggat 
ggtagtatca actttggtat tggttacggt 
gttaaacaag ataatttctt gggaacaggg 
gattatggta cgagtgtcaa tttgggttat 
agtcttggtg gaaatgtttt ctttgaaaac 

60 aactataagc gtacgactta tggaagtaat 
aactcctatt atgtaggatt aggccatacc 
tataaccgta atttatatat tcaatcaatg 
gactttgatt tttcttttgg ttggaactat 
aaaggggtta aagcaagtct tggtggacga 

65 tacaaactaa gtgcagacgt acagggtttc 
gtatctgcaa aagcatctgc aggatatgca 
tatcaaactt atacagcggg tggcatcggt 



agccaagatc tgattaaatg gcaagaaaat 3 00 

ggtgcggtat tagaacaaca atctttggca 360 

aaacacggac gatatttagc tcaagaaata 42 0 

tatttatatc gtgtaggtgg gcaatcttta 480 

tgtggtgcaa attgggcgtt aaaagacgcg 540 

acctgtcgat tagtttcgaa tctatcgtgg 600 

612 



influenzae Rd 



aatgcgaata tttatgctga gggcgatatc 60 

aacaatacaa gagttggacc tcgcgtatcc 120 

gcgggtgatt atactcatca tggaaaagtt 180 

tcagtattat atgattttga cacgaattct 240 

gcgactaatc aatttaaata caccaatcgc 3 00 

attaagctcg gatatggggt tgtagcaggt 3 60 

gcaaatggtg gagttgagta caatcgttta 420 

tatggtgcaa aagttggtgt ggggtacgga ' 480 

486 



influenzae Rd 



ttcggtacga caacgactgt gtttgccgca 60 

ggtgttcaag gtgacttaga acaacaaatc 120 

cgtgtgactg acaatgatgt ggctaatatt 180 

gatgatgtga aagcgcatca agaaggcgat 240 

atcatttcag atgttaaaat caaaggtaac 300 

aacttagatg ctaacgggtt taaagttggc 3 60 

tttgccaaaa gtgtaaaaga gcactatgca 420 

cctattgtca atacgctacc aaataatcgc 480 

gataaagcaa aattggcatc attaactttc 540 

ttacaagaac aaatggaatt acaacctgat 60 0 

gaaggtgcgc aattcgagaa agatttgcag 660 

tatgccaaag cacaaatcac taaaacggat 720 

aatgtaacca ttgatgtaaa tgaaggttta 78 0 

ggtaatctgg gaggtatgtc tgccgagctt 840 

gatactttcc gccgtagtga cattgcagat 900 

gaacgcggtt acggtagcgc aacggtaaat 960 

acattagcga taacccttgt tgttgatgct 1020 

tttgaaggaa ataccgtttc tgctgatagc 1080 

ggaacttggt ataattcaca attagttgag 1140 

ttcttcgaaa cagtcgaaaa ccgaattgat 1200 

gtcgtatata aagtcaaaga acgtaacacg 1260 

acagagagtg gtattagtta tcaagcaagt 132 0 

gcggcagtaa gtatagctgg tacgaaaaat 1380 

accgagccct attttactaa agatggtgta 144 0 

tacgataact ctaaaagtga tacatcctct 150 0 

gttactttag gtttccctgt aaatgaaaat 1560 

tataataaaa ttagtaactt tgctctagaa 1620 

aaatttaaag gtaatggcat taaaacaaat 168 0 

aacagcctta atagaggcta tttcccaact 1740 

gttactattc caggttctga taacaaatac 180 0 

tacccattag acagagatca cctctgggtt 1860 

aatggttttg gaaacaagcg tttaccgttc 192 0 

tcattacgtg gttttgctta tggtagtatt 1980 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



ggacctaacg caatttatgc agaacatggt 
tctgatgtga ttggtggtaa tgcaatcaca 
ccgtttgtga gtgataaaag ccaaaataca 
agtgtttgga atactaagtg gaaatcggat 
agcttacccg attatggcaa atcaagccgt 
tggcaatctc ctattgggcc attggtattc 
aatgatgatg tcgaacagtt ccaatttagt 
<212> Type <" : DNA 
<211> Length : 2388 

SequenceName : SEQ ID 471 
SequenceDe script ion : 

Sequence 



aatggtaatg 
actgcgagtg 
gtccgaacct 
aagagtgggt 
attcgcgcct 
tcttatgcca 
attggaggtt 



gtacttttaa 
cagaacttat 
ccctatttgt 
tagataacaa 
ctacaggtgt 
aaccaattaa 
ctttctaa 



gaagataagt 
tgtaccaaca 
tgatgcggca 
tgtattaaaa 
cggattccaa 
aaaatatgaa 



gcgaatgtaa 
caaaaaggcg 
gccacgcata 
tcagaaaaac 



<213> OrganismName : Haemophilus 
<400> PreSequenceString : 
atgttgaaaa aaacatctct tatttttacc 

caacacctca agcgcaaaaa 
aagctgatcg atatttatgt 
aaaaatacaa aaaaaatttg 
taaccttaat gatttctgaa 
atgtggcaag agcgtgatga ttttagtacg 
acgcaatgtg tctcacaaac aagtgaacgc 
<212> Type : DNA 
<211> Length : 405 

SequenceName : SEQ ID 472 
SequenceDescription : 

Sequence 



influenzae Rd 



gcacttttaa 
atgcaagtag 
caagatgata 
cattatgtta 
cgtggtaaaa 
ctaaaaacga 
ttatctggac 



tgactggctg 
aaaaagtgga 
gagttgttcg 
ctgtcacttt 
attacgccaa 
atctcggcga 
aataa 



tgtgcaaaat 
taaagcctta 
tgttgtacac 
tcaaggcgta 
tattcgttgg 
aattttagca 



<213> OrganismName : Haemophilus 
<400> PreSequenceString : 
ttgagaatta ttattatttt ttttatggga 
gcttgcctat tccgttatgc ttgggctaac 
caaccaacca accaaccaac caaccaacca 
caaccaacca accaaaatag taatgtttct 
tctaccgaaa atagtgatac aaaaacacca 
aaaacgctgg aaagagaaca agcaaacaac 
gttactgttg ttgaagctgg acgttttggg 
gaaaatcgtg tagcgattaa tattgatgga 
ggctttaaag agctttttga aggttatggt 
attgaaacct taaaagaagt aaatatcaca 
ggttccttag gtggttctgt aatttataaa 
aaggattact atgtaagtta caaaaaggga 
acccttactc ttgcaggacg ttataaaaag 
aatgggcatg aacttgagaa ctatggttat 
aaaagagaaa aagcagatcc atataaaatt 
ttcaatccta cagaaaatca tcgttttacc 
cgtgggcaag atttatccta tacattaaaa 
gttgattcta gacacaccaa tgataaaaca 
aatttctctc aaactccatt ttgggatacg 
aaaactcgtg cacgcacaga tgagtattgt 
gacaatccta cgggactaaa agtaacaaat 
cttcaatttg aggaaaaaaa caatacagct 
aaatttattg atactgataa gagagtaata 
gacacttggt atgattgttc aatatttaat 
aaaggtaaca attattatgg ctatgatgga 
gaattaaatg gcaaaaaatt cgctaaaata 
cttccctctt cacctggtta tttagaacgc 
acccaacaat taaatttaga tttaaccaaa 
ctacaatatg gtggatcata taataccgcg 
gatgcttctg atgtgcaatg gtgggcaaca 
aaacctcaca cttgtgcaac gacttatgag 
cctgaatttt cttacttatt acccattaaa 
aattttgtta taactgatta tttatctttt 
tatcaaccaa aatataaaca cggtatcaca 
ttatttattc cattaccaaa caattcaaat 



influenzae Rd 



ttaaatatga 

ggcaggtgtt 

accaaccaac 

gaacaactag 

ccaaaaattg 

attaaagaca 

caaagcggtt 

ttacgtcaag 

aatttcaata 

aaaggggctg 

acaaaagatg 

tacgctacgg 

ttcgatgtgc 

aaaaattata 

gaacaagata 

tttgccgctg 

tatcaacgta 

aagagacgta 

ctaaaactca 

gatgctggtg 

gggaaaataa 

aagagtagtg 

gacgataaac 

tgtgaaaata 

aaatggaaag 

aaggataatg 

ctctggcaag 

gacttcaaaa 

atgaagcgca 

cctacgcttg 

tggaatgcta 

acaacaggaa 

gatttgggtt 

cccaaattac 

tcagatccta 



ccaattttag attagaacgt 
gcttatgcag ctcaaccaac 
caaccaacca accaaccaac 



aacaaataaa 
ctgaaacggt 
tcgttaaata 
ttgccattcg 
ctgaaacctt 
acacgcgtaa 
attctattaa 
cgagagatta 
aaaataatca 
tagtggttac 
acgataaaat 
gtacattatt 
atttatatga 
gtggtaatga 
atatttcatt 
cttattctga 
taagacattg 
cacgtcgaga 
ataaaaccta 
tagtcctaaa 
acgcaaaaat 
aagttgacct 
ataggaaaat 
agagagattt 
tttggcatat 
tggttaatcg 
gtgaggattc 
acctttgtcc 
aatcagtcta 
atcgttatga 
ccgatgatat 
ataaagttaa 



tgtatctggt 
aaaaacagct 
cgaaacgggc 
tggtgtagat 
atcttctcaa 
tggtgcagaa 
gaatggtagt 
tctcataaac 
atcattcgat 
aacaagcaga 
tcaaggtaaa 
aaaattatct 
acatcgttct 
aacccctgaa 
tagttatgaa 
tcaacgtatt 
tgaaggcaca 
tggttcagac 
tgacttcaag 
caacccctct 
aaaagttttt 
tgaaataaaa 
aaaatctatt 
ggacaccaac 
tgaacataat 
tgctggcaac 
ttggactgga 
tcgagttgat 
tctctttgat 
caatatccat 
tgtgaaagga 
ggaaaatgta 



2040 
2100 
2160 
2220 
2280 
2340 
2388 



60 
120 
180 
240 
300 
360 
405 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
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caacaaaata ttgactatat cgccaaacag aacaaaaaat ataaagcaca ttcttatagt 2160 

tttgtttcaa ccattgatcc aacgagtttt cttcgtttac aactaaaata ttctaaaggt 222 0 

tttagaacac caacttcaga tgaaatgtat ttcaccttta aacaccctga tttcactatt 2280 

ttgccaaata ctgatttaaa accagaaata gcaaaaacca aagaaattgc tttcacatta 2340 

cataatgatg attggggatt tatctcgaca agtctgttta aaactaacta taaaaacttt 2400 

attgacctaa tatttaaaaa gcaagaaact tttaaagtag gcggctctgg aagaggtgaa 2460 

acattaccat tttctcttta tcaaaatata aatagagata atgcgtcttt aaaaggtatt 2520 

gaaattaatt caaaagtatt ccttggtaaa atggcaaaat ttatggatgg atttaaccta 258 0 

agctataaat atacctatca aaaaggcaga atgaatggca atattcctat gaatgcaatt 2640 

cagcctagaa ctatggtata tggtttagga tatgatcatc caaatcataa atttggtttc 2700 

gatttctata cgacacatgt agcaagtaaa aatccagaag atacttataa tatgttctat 2760 

aaagaagaaa ataaaaaaga cagcacaatt aaatggagaa gcaaatctta tactattcta 2 82 0 

gatttaattg gatatgtaca accaattaaa aatttaacca taagagctgg tgtatataat 28 80 

cttacaaacc gtaaatacat tacttgggat tctgctcgtt caattcgttc atttggaaca 2940. 

agtaatgtta tagatcaatc aacaggatta ggcattaacc gcttctacgc accaggtaga 30 00 

aattataaaa tgtcagttca gtttgaattt taa 3033 
<212> Type : DNA 
<211> Length : 3033 

SequenceName : SEQ ID 473 

SequenceDescription : 



Sequence 



<213> OrganismName : Helicobacter pylori, strain J99 
<4 00> PreSequenceString : 

atgacttata gaaatggcaa aatagattta aaggaacgct ttagtaaaaa ccgctctttt 60 

aagggcatta aaaagaaaat cgctaaaaaa tatacaatca aaaactcgct ttctataatt 120 

tattccttaa aaacgcattc aaattcttct ctatccatta ataaaaaaat cttcttaggg 180 

ctagggttcg tttcggcttt gagcgctcaa agtgaagatt ataatagttc ggtgtattgg 240 

ctcaatagcg tgaatgaaaa caataataac aaatcctact atattagccc cttacgcact 3 00 

tgggctgggg ggaataggag ttttacgcaa aattataaca atagtcaatt atacataggg 3 60 

acaaaaaacg cttccgcaac gcccaatcat tcttctgtgt ggtttggaga aaagggctat 420 

atcggtttta ttacaggggt ttttaaggct agagacattt ttatcacagg agctgttgga 48 0 

tcgggtaatg agtfcaaaaac cggtgggggg gcgatactgg tttttgaaag ctcaaacgaa 540 

ctaaccacta acggggctta ttttcaaaat aacagagccg ggacacaaac ttcttggatc 60 0 

aatttgattt ccaataacag cgtgaatttg acaaacacgg attttggcaa t'caaacccct 660 

aatgggggct ttaatgttat ggggcgaaag attacttata atggtgggag cgtcaatggt 720 
gggaattttg gctttgataa cgtggatagc aatggcgcaa ccaccattag cggggtaact ' 780 

ttcaacaata acggtgcgct cacttataag ggtgggaatg gtattggagg gagcatcact 840 

ttcactaact ctaatatcaa tcattacaag ctcaatctta acgctaatag cgttaccttt 900 

aataacagca ctctagggag tatgcctaat ggcaacgcta acactatagg gaatgcctac 960 

attcttaatg caaataatat tacttttaat aatttgacct ttaatggggg ttggttcgtt 1020 

tttaatagat ctgatgctca tgttaatttt caaggcacaa ctacgatcaa taaccccact 108 0 

tcaccctttg tcaatatgac cggtaaagtt accattaatc ctaatgcgat ttttaatatt 1140 

caaaattaca cgcccacgat agggaacgct tacacgctct ttagcatgaa aaatggcaat 1200 

atcgcttatg atgatgtgaa taatttatgg aatattatca ggcttaaaaa cacgcaagcc 1260 

acaaaagaca atagcaaaaa cgccacttcc aataacaaca cccacactta ctatgtaact 1320 

tacaatttag gcggcacgct ctatcatttc aggcaaattt ttagccctga ttccattgtt 1380 

ttacaatccg tctattatgg cgcgaataat ctttactaca ccaatagcgt gaatatccat 1440 

gacaatgtct ttaatttaaa aaatattaac gatgataggg ctgatacgat tttttatctt 1500 

aatggcttga acacttggaa ttacacgcaa gcgagattcg ctcaaaccta tggcgggaaa 1560 

aacagcgctt tagtctttaa cgccacgact ccttgggcta atggtgcgat ccctaaatct 1620 

aacagcacgg tgcgttttgg ggggtatgag ggagtcaatt gggggaaaac gggctatatc 1680 

accggcactt tcacagccga tagggtttat atcaccggta acatgatgtc tggcaatggc 1740 

gctcaaaccg gtgggggggc gactttgaat tttgtgggcg cgactgaaat taatatcgct 1800 

ggagccactt ttaaaaactt aaaaaccact tcacaaaact cttacatgac ttttatggcg 1860 

ttagggaatg gctctgggag tggtaagatc aatgtttctc agtctgattt ttacgattgg 1920 

acggatggag ggtatgattt taccggtaat ggcgtttttg acagcgtgaa tttcaacaag 1980 

gcttattaca aatttcaagg cgctgaaaat tcttacaatt ttaaaaacac gaatttttta 2 040 

gcagggaatt tcaaattcca gggcaagacc accattgaaa aatccgtttt aaacgacgct 2100 

tcttacgctt ttgatggcgt gaataacgcc tttaatgaag acaaatttaa tggcggatcg 2160 

tttaatttca accacgcaga gcaaacaaac gcttttaata acaactcgtt tagtggcgga 2220 

tcgtttagtt ttaacgccaa gcaagtggat tttaatggga attcgtttaa tgggggggtg 2280 

tttaatttca ataatacccc taaagccagt tttactaacg acacttttaa tgtgaataac 2340 

caattcaaaa taaatggcgc tcaaacggat tttactttca gtaagggcgt tgttttcaac 2400 

atgcaagggc ttttgagcag tttgagcgta ggcacgactt atcaattgct taacgctaaa 2460 

agcgtgggtt ataaggataa caataacgct ttgtatcaaa tgttgcgctg gactagcgga 252 0 



WO 2005/076010 



159/341 



PCT/IN2005/000037 



gaaaatccta gcggtaaatt agtagatgaa aataaaaccg cgccaaacag cgctaaaatt 25 8 0 

tataatgttc aattcactga taacggcttg acttactaca ttaaagaaaa ttttaataat 2640 

gggatcacgc tcactcgttt atgcactcta ggctacacgc attgcgtgaa tattgataac 27 0 0 

gatgcgttta atcttaaaaa tgtcaataat aacgctagta acaccgtgtt ctatctcaac 2760 

5 ggcatgacga cttggaagac tgctggcaca ggagttttca cgcaagatta cagcggcact 2820 

aacagcgttt tagtgttcaa tcagaccacc ccttttcttg ctggggcgaa tcccacttcc 2880 

aatagcgtgg tgggttttgg gaaaacttca ggggctgaat gggggctagt gggctatatt 2940 

caaggcgttt ttaaagccaa tcaaattgat attaccggca cgattcgctc tggtaatgga 300 0 

gccaaaaccg gtgggggcgc gactttagtg tttaacgctc aagagcgttt gaatatcgct 3 0 60 

10 aacgctaatt tgaataacga taaagccggt ttgcaaaact catggatgaa tttcattgtt 3120 

aataatggca atttgaacgt aacaaacgca aattttagca accaaacccc gcatggaggc 318 0 

tttaacctta aagccaataa tatcacttgg gataaaggct ctgtgagtgg gggggggaat 3240 

tttggtgtgg ataacgccaa cgctaatggg aatgcggtga ttaagaatgt taatttcagc 33 0 0 

gataacggca ctttgattta taaagggggt gaaaacagcg ccggaaattc cctgaccfcta ,3360 

15 gaaaacaaca ccttcaattc ctataatatc aacgccaaag cgcaaaacct tattttcaac 3420 

aacaactcgt ttaacagcgg tagctattcg tttaacgaca ctaaaaatgt tacttttaaa 348 0 

ggcacgaaca cgcfccattaa cagcgatcct ttcagccgcc ttaaaggatc agtttctatt 3 54 0 

gacaataata gtatctttaa cattgaaagg gatttgaccg ataaaaccac ttacacgctt 3 600 

ttaagtgggg ataacatcaa atacaataac caagctttag cggataatgt tttttcaaaa 3 660 

20 aatttatggg atttgatcca ttatgacggc gaacaaggga ctctattaag aacggataac 3720 

aacacttatt ttgtgcaatt cacgcagagc aacggccaaa aatttgtttt tgaagagact 378 0 

tttaatcctg gctctatcac ctataaatat ttcactatcc attcttcgcc tttccacaca 3840 

gaagctgatt ctaaggatat ttggaatcag gtgaggaagc agtttgattt tattccagga 39 00 

aaaaccccgg tgtgcgttgg cgtgtgctat atcgcaccct ataaaaatca agatcttatc 3960 

25 ggctctagcg cttttgcgtg gtcgctgaat tttggggcta cggtggtggg gactttgctt 4020 

ttagggagcg ctcaagaaaa agccaataac aatggcggcfc cgatctggtt tggtaagaat 4080 

aatttgctgt atttgcatgg caattttaac gcgactaata tctttttaac gaataacttt 4140 

aatgtcggca accctaacgc cggcggtggg gcaacgatta attttaacgc tgatgaaacc 42 00 

ttgagcgctg acgggttgaa ttacacgaat ttccaaaccg tggctatggg cttacaaact 42 60 

30 agcgcgagcc agcattcatg ggcgaatttt aattccaagc tttctatgga gattaaaaac 4320 

tccaacttta gggatttcac atggggaggc tttaggttca attcagggcg tatcactttt 438 0 

gaaaacacca cttttagcgg ctggactaat attaacgggg cgactgaaag cggttcatcg 4440 

tatgtgaaca tggttgcgaa tacggatttg attttcactg attccatttt aggagggggc 450 0 

attcgctatg atttgaaggc taataacatt attttcaata acactcaaat ggtggttgat 4560 

35 gtgtctaaaa acgtgaatca gtcttcattg aatgggaatg ttactttcaa tcattccagg 462 0 

ctttcagtca aacccaatgc ggctatcaat attggggggg atcagaccca aacgacttta 4680 

gaaaacgctt caagcctttc tttttataac gatagcgtag cgaattttaa cggcacgacc 4740 
gcttttaacg gggtgtctta cttgaattta aaccctaacg ctcaagtcag cttcaatcaa ' '4800 

gcgaatttca ataacgctaa tgtaaccttt tatggcattc cgctatttgg taaaacgccc 48 60 

40 aattttggca actctgtgcg ccttatcaat ttcaaagggg acgcaaagtt taatcaagcc 4920 

acgctcaatt taagggctaa aaatatccat ttgaatttcc aaggggcttc cacttttgaa 4980 

aataactcta cgatgaattt ggctgaaagt tctcaagcga gctttaacgc tcttagcgtg 504 0 

gagggggaaa cgaatttcaa tctcaacggc tcaagtttat tgagfcttcaa tggtaacagc 5100 

gfcttttaacg cccctgtgaa tttctacgct aataattctc aaatttcttt cactcattcg 5160 

45 gcgactttta atgcagacgc ttcatttgat ttaggcaata "acagcaccct gaattttcaa 522 0 

agcgttcttt taaacagcgc tctaaacctt ttaggcaatg gcggtaacaa tctagcgatt 52 80 

aatgctaaag ggaattttag ttttggatct caagggattt tgaatctgtc ttatatgaat 5340 

ctatttggag gggataaaaa agcttccgtt tatgatgtgt tgcaagccca aaatattgat 5400 

ggcttaaggg ggaataacgg ctatgagaag atccgttttt atggcataca gattgaaaag 54 60 

50 gccgattact cgttcaataa tggcgttcat tcttggagct tcactaaccc gctcaacacg 5520 

actgaaacca ttaccgaaac cttacataat aaccgcttga aagtgcagat ctctcaaaac 55 80 

ggtgcttcta ataatgcgat gtttaatctc gcfccctagct tgtatgatta ccaacaaaac 5640 

ccttatgatg aaagcgagaa ttcctataat cacacaagcg ataaagccgg cacttattat 5700 

ttgagtagca gtatcaaagg ctttggtaag aataatgaaa tacccgggac ttataacgcg 5760 

55 caaaaccaac ccttacaagc tttacacatt tataatcagg ctatcagtaa gcaggatttg 5820 

aacatgatcg ccagtttggg taaggaattt ttgcctaaag tggctaaact tatcgcttca 58 8 0 

ggggctttag acaatctcaa tctcaatagc ccggatagct ttgaaacgat ttttagtatc 5940 

ttaaaagaat atggcattac tttaaaccaa gcgaattgga agagcttatt gaagatcatc 60 00 

aataattttt ctaacacggc taattatcat ttttctcaag gtagtctcgt tgtgggggcg 6060 

60 atcaaagaag ggcaaacgaa cacgaatagc gtggtgtggt ttgggggcga tgggtataaa 6120 

aatccatgcg cggttgggga taacacttgc cagatgttca ggcagactaa tttagggcag 6180 

ttgcttaatt ctagcgtgcc ttatttgggt tacattaacg ctaattttaa ggctaaaaac 6240 

atttatatca ccggaaccat cggcagtggg aacgcttggg ggagcggagg gagcgcgaat 63 00 

gtgtcttttg aaagcgcgac aaatttggtg cttaatcaag ccaatattga cgctcaaggg 63 60 

65 accgataaga tcttttctta cttgggcaaa gagggcattg ataagctttt tggagaaaaa 642 0 

ggtttaggga atgtgctttc taatattgtt tatgaagaga gcttgaatga taacgctatc 6480 

cctaaagatt tagccaacat gatccctaaa gatttgggat ctaaaacttt aagctctttg 6540 



WO 2005/076010 



160/341 



PCT/IN2005/000037 



cttagcccta ctgaagtgaa taacctctta ggcgtgagtg cttttaaaaa cgcgatcatg 660 0 

gaaatcttaa attctaaaac ggtgggcgat gtttttggtg aaaacgggct tfctaaacgcg 6660 

ctagatcctg taaaaagaaa agaaattgat caaatgcttt tagagcaaat ccaagcccat 672 0 

tcttcagggt ttgaaaaatt catcgttaaa actttaggga ttgaaaatgt agagaatttc 6780 

atcaataact ggtatggcaa gcaaagcttg agttcttttg ccaataattt tgtgcctgga 6840 

ggcttgaatc aagccctaga taaaataggt tctagctctg atgccaaaga cttacagagt 6900 

ttcttagata aaacgacttt tggggatatt ctcaatcaaa tgatcaatca agccccttta 6960 

atcaataagc tcatttcttg gctcggcccg caggatttga gcgtgttagt gaatatcgct 7020 

ttaaatagta tcactaaccc tagtaaggaa ttattgggtg cgatttctgg catgggtcaa 708 0 

aaagtgctga acgatttgct aggcgaaggt gtagtgaata aaatcatgag caatcaagtt 7140 

ttagggcaaa tgatcaataa aatcatcgct gataagggct ttggaggcgt ttatcafccaa 72 00 

ggtctaggct caatactccc taaatcctta caagatgagt tgaagaaatt gggtatgggc 7260 

tctttactca aacctaaagg cttgcacaac ctctggcaaa aagggaactt caatttcgtg 732 0 

gctaaaaacc atgtgtttgt gaataacagc ttgtttagta acgccacagg gggggaattg 73 80 

aattttgtag cgggcaagtc cattattttt aatgggaaaa acaccattaa tttcacgcag 7440 

tatcagggca ggctttcttt tgtatctaaa gatttttcta atatttcatt agacacttta 7500 

aacgctacca acggcttaac gcttaacgct tctaaaaatg atattagcgt tcaaaaaggt 7560 

caaatttgcg tgaatgtctt agattgcatg accgctaaag ggaaaaccac tcaaactaat 7620 

tcctcttcaa gcgcgacagc cccaactaat gaaacgctag aagtgagtgc gaataatttc 7680 

gctttcttag gcaccattaa ggctaatgga ttagtggatt tttcaaaagt cttacaaaat 7740 

acgactatcg ggactttgga tttagggcca aacgctactt ttaaagcgaa taatttgatc 78 00 

gtgaataacg cttttaataa taactctaat tacagggcta atatcagcgg taatttcaat 7860 

gtggccaagg gtgcaacttt tagcacgaat gaaaatggtt tgaatgtggg ggggaatttt 7920 

aacagcgaag ggccattaat ctttaatctt aataacccca cccatcaaac gattatcaat 7980 

gtaactggca cttctacaat catgtcttat aacaatcaag ctttaatcaa ctttaacacc 8040 

caactcaagc aaggcgctta tacgcttatt aacgctaatc gcatggttta tggctatgat 8100 

aatcaaacga ttcttggggg gagcttgagc gattacctca aactttacac tctcattgat 8160 

tttaacggca aacgcatgca attgaatggc gattcgttaa gctatgacaa ccaaccggtc 822 0 

agtattaaag atgggggtct tgtggtaagc tttaaagaca atcaagggca aatggtgtat 8280 

tcatctatcc tttatgataa aatccaagtt accgtctctg ataaacccat gagcattcaa 8340 

gcccctagtt tggagtatta tgttaaacgc attcaaggta gtgctggttt gaatgcgatc 8400 

aaatctgcgg gcaataattc cattatgtgg ttgagtgagc tttttgcggc taaggggggt 8460 
aatcccttgt tcgcccctta ttatttgcaa gacaatccca ctgaacacat tgttacttta ' 8520 

atgaaagata ttaccagcgc tttaggaatg ctctctaact ccaatctcaa aaacaactcc 8580 

actgatgttt tacagcfccaa cacttacacg caacaaatga gccgtttagc caagctttct 8640 

aatttcgctt cttttgattc aacggatfctt agcgaacgct tgagcagtct taaaaaccaa 8700 

agatttgccg atgctgtccc taatgcgatg gatgtgattt taaaatactc tcaaagggat 8760 

aaactaaaaa acaacctttg ggcgaccggt gttgggggcg tgagctttgt ggaaaatggc 8820 

acaggaacgc tctatggtgt caatgtgggc tatgatcgct ttgttagagg ggtaattgtt 8880 

ggagggtatg cggcttatgg gtatagcggg tfcttatgagc gcatcactag ttctaaatcc 8940 

gataatgtgg acgtgggttt gtatgctagg gcttttatca aaaagagcga gctgactttt 900 0 

agcgttaatg aaacttgggg ggctaataaa acccaaatca gctccaacga cgctttgctt 9060 

tctatgatca atcagtccta taaatacagc acatggacaa cgaacgcgaa agttaattac 912 0 

gggtatgatt tcatgtttaa aaacaaaagc atcattttaa aacctcaaat tggtttaagg 9180 

tattactata ttggtatgag cggtttagaa ggggtgatga ataacgtgct ctataaccag 9240 

tttaaagcga acgccgatcc gtctaaaaaa tccgttttaa cgattgattt tgctttggag 93 00 

aaccgccatt atttcaacac aaactcttat ttttatgcga ttggtggcgt tggtagagac 93 60 

ttgttagtta attctatggg ggataaattg gtgcgtttta ttggtaacaa cactttgagt 9420 

tacaggaaag gcgatcttta taacactttt gcgaacatca ctacaggcgg ggaagtgagg 948 0 

ttgtttaaaa gcttttatgc gaacgctggg gtgggggcta ggtttggatt ggattacaaa 9540 

atgatagata ttataggaaa tattggaatg cgtttagcgt tttaa 9585 
<212> Type : DNA 
<211> Length : 9585 

SequenceName : SEQ ID 474 

SequenceDe script ion : 

Sequence 



<213> OrganistnName : Helicobacter pylori, strain J99 
<400> PreSequenceString : 

atgaaacaat ttaaaaagaa accaaaaaag ataaaacgat cgcatcaaaa tcaaaaaaca 60 

atcttaaagc gtcctttatg gcttatgcct ttactgattg gcgggtttgc tagtggggtg 120 

tatgcggatg gaacagacat tttggggctt agttgggggg aaaaaagcca aaaggtatgc 180 

gtgcatcgtc catggtatgc tatatggagt tgcgataaat gggaggaaaa aacacaacaa 240 

tttacaggaa accaactcat cacaaaaact tgggcagggg gtaatgcggc taactactac 3 00 

cactctcaaa acaaccaaga catcacagcc aatttaaaaa atgataacgg cacttatttt 3 60 

ttaagcggtc tgtataacta caccggaggg gaatataatg gggggaattt agacattgaa 420 
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ttaggcagta acgctacttt taatctaggt 
tatcctaatg ggcatactga tgttactttt 
gtagaagtgg gcaatcgtgt gggatcggga 
aacttgaacg ctaataaggt tactatcaat 
5 gtgaatgtag gcaatgctaa cagcgttatt 
acttgcagtt ctttagctag ggtgggcgta 
tattctttta aagggacgac taacgctact 
ttcacttttg aagagaacgc cacttttagc 
ttcaataaaa agtttaacgc taccaataat 

10 aaaggcacaa gctcttttaa tggtgcgaat 
caagccactt tccaaaacag ctcctttaat 
aatcaaagca cccagcaccc ccaaattcaa 
cttaagggtt ttgcgacttt tgagcaagcc 
caaaacgctt cctttaataa cgctactttc 

15 gatgcgagct ttaataacac ttcgttcaac 
agtggtggcg ttactttaag cggtaaaaat 
gggagttcta aaatcactct cactcaaggg 
gagaagagcg taacgatttt aaattctaga 
catgcgatca atagcttgac aaacgcccta 

20 agtttcgctc aaggtttgtg ggatatgatc 
aatgaaaacg ctgcaacatc taaacccact 
tctacgcaag tctatcaagt gggttacaaa 
actttcagcc acaattccat tattattcag 
cctgtcatta acggctccaa attfcgactta 

25 ccttggtata accataaata ttatattcct 
tattacttgc cgagcgttca aatatggggg 
agcgcaagta atagcaatct ggtgattggg 
tcttctagcg acacggtggc ttttggggac 
gggccttggc cctattacca atgcacaggc 

30 gtctatatca cagcgaatct gcgttctggc 
ctaatcttta atggggtaga tagtatcaat 
gccggggctt attcaagctc tatgactttt 
ttgaatggcc taaattctaa cggcaagctt 
gccaaagacg ggaaattcat tttcaatgca 

35 aatggaggga gttaccaatt cagcggcgat 
aatagcggtt cgtttgagat tggcgcaaaa 
aacagcactt cttttaattt caataattct 
actaacgcta attcaaattt gcaaatcgct 
ggctctcaaa ataccgctaa ttttaataat 

40 acctttgata acgtggtatt taacagccct 
ctcaataaca tcactttaaa aaacttgaac 
gtttttagcg ctcattcggt gattaatatt 
acccttgtaa gctcttctaa agcaattgaa 
cagctcatca actaccaagg gcatggggct 

45 aatggcgtct atgatgtggt gtattctttc 
ttttcaccca acagcatttc tatccggcgt 
gatatggaaa aatcggatcg tttgtattat 
cctaatagct ataacaataa tttagggaat 
agcattgact tttatgcgag cgggaaaact 

50 ttcactgggc aaaacagcgc gatcgttttt 
gatgcgccgc aatctaatgt gatcattcgc 
gatgcgagtg ggcattgctg gaatttgcaa 
gcgcaaaaga tttacatcac cggcagtatt 
ggcgcgagcc ttaattttaa cgggcttcaa 

55 aaccgcgccg ctggcacgca aagctcttct 
caggctcaaa actcctattt tatagacgat 
agtttcaacg ctttgaatct ggatttttct 
acgcagtctg tttttaaatfc caatgccgtt 
ttaagctctg gtttgtatca aatgcaagct 

60 agcgtttcag tggggacaag cagcattaaa 
tctatcaatg cgagcaacca ttcaacctta 
accagctcgc tcaacctcaa ccaaagcgcc 
gattatgcga gcttgattgc gagtaatggc 
ttcaattcag cgaatattac tacgagtttg 

65 gtctctttac gagggcagtt taatttaagc 
agcgctatca cctctaacac ggcgtttaat 
atcactttcc atcaagccct tgacattaaa 



gcgagtagtg ggaatagctt cacttcttgg 480 

agcgctggga ctatcaatgt gaataacagc 540 

gctggcacgc acaccggcac agccacttta 600 

tccaatatca gcgcgtataa aacttcgcaa 660 

accattaatt cggtttcttt aaatggggat 720 

ggggctaatt gctccacttc tgggcctagc 780 

aacacgactt ttagcaattc aagcggcagt 840 

ggggcgaaat taaatggggg ggcattcact 900 

accgctttta atagcggtag ttttactttt 960 

tttagtaacg cttcctatac ttttaataat 1020 

ggggggactt ttacttttaa tgaccagacc 1080 

aacagctctt ttagcggcag tgctaccact 1140 

tttaacaatt caaaccacca actaacgata 12 00 

aacaataccg gtaaaatcac tatagaaaaa 1260 

actcctgttg atacaaacaa catgactatt 1320 

gacttgaaaa atggtgcaac ccttgatttt 13 80 

acgactttca acctcacaag tttaggcagt 1440 

ggtgggatca cttacaatca tcttttaaac 1500 

aaaacgaacg aaagctcttc aaaaccgcaa 1560 

acttacaatg gggttaccgg gcagcttttg 1620 

gactcttcgc cctctaaatc ctctacaaac 1680 

ataggggata ctatctacaa actgcaagaa 1740 

gctttagaga gcgggactta cacgccaccc 18 00 

tccgcttcaa attatatcaa tgctgacatg 1860 

aaatcccaaa attttacaga gagcgggact 1920 

agctacacta actcgtttaa acaaaccttt 1980 

tataacgcaa catggactga tcacaatgtt 2040 

acttcaggga gcgctcttaa tgggcattgc 2100 

acgactaacg gcacttatag cgcttatcat 2160 

aatcgtatag gcaccggtgg ggcagccaat 2220 

atcgctaacg ctaccatcac gcaacataac 2280 

tccacgcaaa acatggacaa ttcgcagaat 2340 

ttggtgtatg gcacaacttt cactaaccaa • 2400 

gggcaagcga cttttgaaaa caccaacttt 2460 

agcttgaatt tttcaaataa caaccagttc 2520 

aatactattt ttaataacgc taattttaac 2580 

agcgcgacca cttcgtttgt gggggatttc 2640 

gggaacgctg tttttgggaa ctctactaat 270 0 

accggctctg ttaatattgc agggaatgca 2760 

acgaacacga gcgtgaaagg gaaagttact 282 0 

gctcctttgt cttttggcga tgggacgatt 2880 

ggtgaagcta tcacaaatgg caaccctatc 294 0 

tacaacgacg ctttcagtaa aaatctatgg 3000 

agcagtgaaa agctcgtttc tagtgcgggt 3 060 

aacaaccaaa cctacaattt ccaagaggtt 3120 

ttgggcgttg gcatggtgtt tgattatgtg 3180 

caaaacgctc tcggttttat gacctacatg 3240 

ttaaacaaca ccatttacta ttacgacaac 33 00 

ctattcacta aagcggaatt ttctcaaacg 33 60 

ggggctaaaa atatatggac gagcgtaagc 3420 

tttggggaca ataagggagc agggagtaat 3480 

tgcataggct ttatcacagg gcattatgaa 3540 

gaaagcggga accgcatttc tagcggtggg 3600 

ggcattcttt taacgaacgc gactttgtat 3660 

atgaattttg tttctaacag cgcgaacatt 3720 

accgcacaaa ataaaggcaa ccctaatttt 3780 

aacagctctt ttagaggcta tgtggggcaa 3840 

aatgcgatca gtttcactaa cagctctaat 3900 

aaaagcgtgt tgtttgacaa ttccaattta 3960 

gccaatgcga tcaatctttc tcaaaacgcc 4020 

gaacttcaag gcgatttgaa tttgaacgac 4080 

attaatgttt ctaacaacgc cacgatcaac 4140 

tctcacctta attttaacgg ggcggttaat 4200 

agtagttcct ctatcgtgtt taagggggcg 4260 

aataattctt ctttagattt tcaaggctct 4320 

ttctatgata acgctttttc tcaaagcccc 4380 

gtgcccttga gtttgggagg caacctctta 4440 
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aaccctaaca acagtagcgt gctgaattta aaaaacagcc agcttgtttt tagcgatcaa 450 0 

gggagcttga atatcgctaa cattgattta ctaagcgatc tgaatggtaa taaaaatcgt 4560 

gtgtataaca tcattcaagc ggacatgaat ggtaattggt atgagcgtat caacttcttt 4620 

ggcatgcgca ttaatgatgg gatttatgac gctaaaaacc aaacttatag tttcactaac 4680 

5 cctctcaata acgccctaaa aatcaccgag agctttaaaa ataaccaact gagcgttacg 4740 

ctctctcaaa tcccgggcat taaaaacacg ctctataaca ttggctctga aatctttaac 4800 

taccaaaagg tttataacaa cgctaatggc gtgtattctt atagcgatga cgcacaaggc 4860 

gtgttttatc tcacgagcag cgtgaaaggc tattacaacc ccaaccaatc ctatcaagcc 4920 

agcggcagca ataacaccac gaaaaataac aatctaacct ctgaatcttc tgtcatttcg 498 0 

10 caaacctata acgcgcaagg caaccctatc agcgcgttac acgtctataa caagggctat 5040 

aatttcagta atatcaaagc gttagggcaa atggcgctca aactctaccc tgaaatcaaa 5100 

aagatattag ggaatgattt ttcgctttca agtttgagca atttaaaagg cgatgcgcta 5160 

aaccagctta ccaagctcat cacgcctagc gattggaaaa acattaacga gttgattgat 5220 

aacgcaaaca, attcggtcgt gcaaaatttc aataacggca ctrttgattat aggagcgact 5280 

15 aaaatagggc aaacagacac caatagtgcg gtggtttttg ggggcttggg ctatcaaaag 5340 

ccttgcgatt acactgatat tgtgtgccaa aaatttagag gcacttattt ggggcagctt 540 0 

ttggagtcca tctcggctga tttgggctat attgacacga cttttaacgc taaagaaatt 5460 

tatcttaccg gcactttagg gagcgggaac gcatggggga ctggggggag tgcgagcgta 552 0 

acttttaaca gccaaacttc gctcattctc aaccaagcga atatcgtaag ctcgcaaacc 558 0 

20 gatgggattt ttagcatgct gggtcaagag ggcatcaata aggttttcaa tcaagccggg 5640 

ctcgctaata ttttgggcga agtggcaatg caatccatta acaaagccgg gggattaggg 570 0 

aatttgatag taaatacgct agggagtgat agcgtgattg gggggtattt aacgcctgag 5760 

caaaaaaatc aaaccctaag ccagcttttg gggcagaata attttgataa cctcatgaac 582 0 

gatagcggtt tgaacacggc gattaaggat ttgatcagac aaaaattagg cttttggacc 588 0 

25 gggctagtgg ggggattagc cggactgggg ggcattgatt tgcaaaaccc tgaaaagctt 594 0 

ataggcagca tgtccatcaa tgatttattg agtaaaaagg ggttgttcaa tcagatcacc 60 0 0 

ggctttattt ccgctaacga tatagggcaa gtcataagcg tgatgctgca agatattgtc 6060 

aagccgagcg acgctttaaa aaacgatgta gccgctttgg gcaagcaaat gattggcgaa 612 0 

tttttaggcc aagacacgct caattcttta gaaagcttgc tgcaaaacca gcagattaaa 618 0 

30 agcgttttag acaaagtctt agcggctaaa ggattagggt ctatttatga acaaggtttg 6240 

ggggatttga tccctaatct tggtaaaaag gggattttcg ctccctatgg cttgagfccaa 63 0 0 

gtgtggcaaa aaggggattt tagtttcaac gcgcaaggca atgtttttgt gcaaaattcc 63 60 

actttctcta acgctaatgg aggcacgctc agttttaacg caggaaattc gctcattttt 6420 

gccggaaaca accacatcgc tttcactaac cattctggaa cgctcaattt gttgtctaat 6480 

35 caagtttcta acattaacgt caccatgctt aacgctagca acggccttaa gattaacgcc 6540 

actaataaca atgtttccgt gtctcaaggc aatctgttta tcaacgctag ctgcgtgcaa 660 0 

caaagcgatc caacgacagc tagcgccaca aacccttgca ccaccgctca aaataacgct 6660 

tcttctagta atgcgtcaaa caacgcgcca atcgccttaa ataataacga tgaaagcttg 6720- 

gtggttacgg cgaatggttt caatttttca ggcaatattt acgctaacgg ggtggttgat 6780 

40 ttttcaaaaa ttaaaggctc tgcaaacgtt aaaaacctgt atctttacaa taacgctcaa 6840 

ttccaagcca acaacctcac gatttccaac caagcggtat tagagaaaaa cgctagcttt 690 0 

gtaacgaata acttaaacat tcaaggagcg tttaacaaca acgccacgca aaaaatagag 6960 

gtgcttcaaa atttagtgat cgcttcaaac gcttctttaa gcaccgggat ttatgggtta 702 0 

gaagtagggg gggcattgaa taatttggga gcgatccatt ttaatttaga aaattctcaa 708 0 

45 acgcctgtaa atccgctcat tcaagtaggg gggatcatta atctcaacac cacccaaacg 7140 

ccttttatga atgtcagcgt ggctaatggc ggaacttaca ctttattaaa aagcagccgt 72 0 0 

tatattgatt acaatatcaa ccctaacagc ttgcaatcgt atttgaagct ctatacctta 7260 

atcaatatca acggaaacca catagaggaa aaaaacggcg tattgactta tttgggccaa 7320 

cgggttttat tacaagataa ggggttatta ttgagtgtag cactacctaa ctcaaacaac 73 80 

50 gcctctcaaa acaacatttt aagcctttct gtccttcaca accagattaa aatgtcttat 7440 

ggtaataaag tgatggactt tacccctccc accttacagg attacattgt gggcattcaa 750 0 

ggacaaagcg cactcaatca aattgaagct gttgggggga ataacgctat caagtggctt 7560 

tcaacattga tgatggagac taaagaaaac ccgctttttg cgccgattta tttagaaaac 762 0 

cactctttaa atgaaatctt aggcgtaaca aaagatcttc aaaacaccgc aagcttgatt 7680 

55 tctaacccta attttagaaa taacgctacc agccttttag aaatggcgag ttacacccaa 7740 

caaaccagcc gtttgacaaa actctctgat tttagggcta gagagggaga gtccaatttt 78 00 

tcagagcgct tgttagagct taaaaacaag cgttttagcg atcctaaccc tagtgaggtt 7860 

tttgtcaaat actctcaact cagcaaacac cccaataacc tttggattca aggggtggga 792 0 

ggagcgagct ttatttctgg gggcaatggc acgctttatg gcttgaatgt gggctatgac 7980 

60 cgattggtta aaagcgtgat ccttgggggt tatgtggctt atggctatag cggttttaac 8040 

gggaacatca tgcattcttt ggctaataat gtggatgtgg ggatgtatgc gagggctttt 8100 

ttgaaaagaa acgaattcac tttgagcgcg aatgaaactt atggaggcaa tgcgagtcat 8160 

atcaattctt ctaattcctt gctctctgtg ttgaaccaac gctacaacta caacacctgg 8220 

acaacgagcg tgaatgggaa ttacggctat gatttcatgt tcaaacaaaa aagcgtggtg 8280 

65 ctaaaacctc aagtgggctt gagctatcat ttcataggct tgagcgggat gaaaggtaaa 8340 

atgcaaaatc cagcttacca acaattcgtc atgcattcaa acccttctaa cgaatcggtt 8400 

ttaacgctca acatggggtt agagagccgt aaatattttg gtaaaaattc ctattatttt 8460 
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gtaacggcga ggttgggtag ggatcttttg 

tttgtgggtg aaaacacttt attgtaccgc 

gtgatcacag gaggcgaaat gcatttgtgg 

cttaaaatgg gcttgcaata ccaagatctt 

gcgttttag 

<212> Type : DNA 

<211> Length : 8709 

SequenceName : SEQ ID 475 
SequenceDescription : 

Sequence 



<213> OrganismName : Helicobacter pylori, strain J99 
<4 0Q> PreSequenceString ... 

atggcgttta aaaaggccag attgatttcc aggtttattt caaaaggatc tttcaaattg 60 

aataagatct caaagaaatt tfctcacattg aatcaaatct taaagcgtga aaagccctta 12 0 

aaacgccata aaaaaacaaa atctattgaa aagcccttta ataaaaacaa atctttttta 180 

aaagcttcgg ttttattgat aggagcgcta ggggggttat cccacctaag ggctaacgaa 240 

tgccgttatt ggtcatggtc gtcttggagt tatcaagaca atattgaaag cggtcctaat 3 00 

tcacccacgc acaactctta ttgtcttttt agtagcgctc aaggctctgg gacttattat 3 60 

ttaaacactc ttaccactta tagcgctggt ggggctagtt tcacgcaaaa attcaatggt 420 

ggcacgcttg atataggggg gaatatccgc tttggaggca caggtattaa tggaggtgat 48 0 

gtagggtata tcactggaac ttataatgct caaacgatga attttaattc tagccatatc 540 

acaaccggaa actcatacgc tgatggcggt gggaccacgc tcaattttaa cgcgactaac 600 

aatatcacta tcaatcaagc gagctttgat aacagcgatg cagggacaca aaaatcttac 660 

atgaatttta aaggctctaa tatcaagatc agtggctcta gctttacaga cgacaccaat 720 

ggaggtttta atttcagcgg taataacaat aatagcacca tctctttcaa tcaaaccagc 780 

ttcaatcaag ggacttataa ttttagtaac agcgccactt taagcttcaa taacagcaat 84 0 

ttcaatcaag ggacttatca ctttaacagc gcccaatcca cttttgaaaa cagcaatttc 900 

aatcaaggca cttataattt taatgacaat actagcttta ataacgacac cttcaatcaa 960 

ggcacttata attttaatag cagcaaggtg agtttttcag gcgctaacac tttaaattca 1020 

agttcgcctt ttgctagcct taaaggcagt gtgtctttta attctggtgc gatttttaac 1080 

ctcaatcaaa cccttaataa taatcaaacc tatgacattc tcactacaaa cggagcgatc 1140 

cagtatgggg tttatcaaag ctatttgtgg gatctaatca actataaggg cgataaagcc 1200 

attagccatg tfcgaagtgag taataacact tatgatgtaa cctfctgacat taacgggcaa 1260 

gatgaaacct tacaagaaac ctttagcaac caatctatta ttacccaatt tttaggagac 1320 

gatttacaac aacaagccca acaaacctat caagaggatg tagctaattc ccagaacgct 13 80 

ttgaataagg ttgctagcga caacacgatc gcaaataacg atacaagcta cactcaaagc 1440 

agtaacccca ctatccttaa agacgctcaa ggtttagaaa acaccaacca acaaatccaa 1500 

caagacgaaa aagccttaga aaaagattta gcccaaatca agcaattagc caactccacc 1560 

acaggcttta acgaacaagc tttcactcaa gctcaaaaac aagaacaaca agatgaacaa 1620 

gccttacaga acgatgaaaa cgcttttaat acggaacaag agggattaga acaagcgata 1680 

gctaacgcta aacatgccaa ccccacacca aatccgacac caagccccac acccactcct 1740 

ataaaacaca cagcgccaaa cactccccct agtcaagtcc cgcccacacc ccctagtcaa 18 00 

aatttaccta aaacaaatgt gtggaatggg gtttattggc ttcaaaacaa aacttactca 1860 

aacaaaggca tttattatat tgatcccaat ctttcaggac agagcggtca aagcggcaac 192 0 

acgctcagca cctatacagc taatttgtta gggagaagtt ttggcgtcaa tgctaacaat 1980 

ggcactttga tcatagggaa taatacagag agtgtgaatg ataacgggtt gatttggata 2 040 

gggcatggag gctttggcta tattacggga acttttagtg cggctaacat ttacttgacc 2100 

aataatttta aaaccggtga aggcgtttca aattcagatg gtgggggagc gaacattacc 2160 

tttaaagcaa gcgataatat cactatggat ggcttgaatt acaataacgc tgaaaccgtt 222 0 

actaaaatga ttcaaacagg ggccagtcag cattcctata ccacttttga cgctaccaat 228 0 

aatatcagtg taactgattc tgattttagc gatatgactt gggggaaatt cagttttagc 2340 

gctaagaata tttcgttttc taacgcttcg ttcagcggct ttacaaaccc tggaggatca 24 0 0 

agcactatca gcacgaatgc ttctaattct ttaagcttta cagattctcg cttgaatggt 2460 

ggagcaatct ataatttaca ggctaatagc cttattttca ataacacgca agcggttttt 2 520 

aatgtcttgt attctagggg gacaagcaat tttaacgcca ccacacagct, tttaggcaac 2580 

acgagtttta cgcttagctc tcaaagtttg ctaaacttta atggcgatac aaccttgcaa 2640 

aacaacgcta atatcacgct tggcaataaa agtcaagccg cttttaaaaa ttcfcttaacg 2700 

cttgataaca attctaattt aagcttagac aatcaaagcg ttttgaacgc gaatggcacg 2760 

agtgctttta acaatcaagc gagtctcaac atttataatg ggagtcaagc ggcctttagc 282 0 

agtctctttt ttaatggcgg aacactcagt cttaacgcga atagcaagct caacgcttct 2880 

agcgctagtt tttcaaacaa caccactatt aatttagacg atagcgtttt gaatgcgaat 2940 

aacacaagct ctttaaacgc taatatcaat tttcaaggcg caagccaggc tgattttgga 3 00 0 

ggcaacacga ctattgatac agcaagcttt aattttgaca gcgcaagttc attgaatttt 3060 

aataacctta cggctaatgg ggcgttaaat tttaatggtt atgcgccctc tttaactaag 3120 

gctttaatga atgtcagcgg gcagtttgtt ttagggaata atggggatat taatttatct 3180 



atcaaagcta aaggcgacaa 
aagggggaaa tttttaacac 
cgtttgatgt atgtgaatgc 
aatatcactg ggaatgtggg 



tgtggtgcgt 852 0 
ttttgcgagc 858 0 
gggggtgggg 8640 
cat gcgagtg 870 0 

8709 
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gacatcaata tctttgacaa catcacaaaa tctgtaactt acaacatctt aaacgctcaa 3240 

aaagggatta ctggcattag tggggctaat ggctatgaaa aaatcctttt ttatggcatg 33 00 

aaaatccaaa acgctaccta tagcgataat aacaacatcc aaacttggtc gtttataaac 3 3 60 

cctctcaatt cttctcaaat cattcaagag agcattaaaa atggggatct aaccatagaa 3420 

5 gttttaaata accctaactc ggcttccaac actattttta atatcgctcc tgagctttat 34 80 

aattaccaag attctaagca aaatcctacc ggctatagct atgattatag cgacaatcaa 3 540 

gcaggcactt attacttgac aagcaacatt aaaggtcttt tcacccctaa aggctctcaa 3600 

acgcctcaaa ccccaggcac ttatagccca tttaaccagc ctttgaatag tttgaatatc 3660 

tacaataagg gtttttctag cgagaattta aaaacgcttt tagggatcct ttctcaaaat 3720 

10 tccgccacct taaaagaaat gattgaatcc aaccaactag acaatatcac taacattaat 3780 

gaagtgttgc aactcttaga taagattaaa atcacccaag cgcaaaagca agcgctccta 3 840 

gaaacgatca accatttgac tgacaacatc aatcaaacct ttaataacgg gaatctcgtt 39 00 

ataggcgcta cccaagataa tgttacaaac tctactagct ctatatggtt tgggggcaat 3 960 

ggctatagca gcccttgcgc gctagatagc gccacttgtt cttcttttag aaacacttac . 4020 

15 ttggggcaat tattaggctc aacttcccct tatttaggct acattaacgc tgattttaaa 40 80 

gctaaaagca tttatattac cgggacaatt ggaagtagta acgcttttga aagcggaggg 4140 

agcgcggatg taacctttca aagcgctaat aacttagtgt tgaataaagc taacatagaa 4200 

gctcaagcca cagacaatat ctttaatctt ttgggtcaag aagggattga taaaatcttt 42 60 

aatcagggga atttagcgaa tgttcttagt caaatggcta tggaaaaaat caagcaagcc 4320 

20 ggcggtttag ggaactttat agaaaacgct ctaagccctt tgagtaagga attacccgct 43 80 

agcttgcaag atgaaacctt aggccaactt ataggtcaaa ataacttaga tgatttattg 4440 

aataatagtg gagtcatgaa tgaaatccaa aacattatca gtcaaaaact aagcattttt 45 00 

ggcaattttg ttaccccatc catcatagaa aactaccttg ctaagcagtc tttaaaaagc 4560 

atgctagacg ataaagggct tttgaatttt atcggtgggt atatagacgc ttctgaatta 4620 

25 agctctattt taggcgtgat tttaaaggat attactaacc cccctacaag cctgcaaaaa 4680 

gacattggtg tggtagcgaa cgacttgttg aacgagtttt taggacaaga tgttgtcaaa 4740 

aagctagaaa gtcaaggctt ggtgagtaat atcatcaata atgttatttc tcaaggcggg 48 00 

ttgagcggcg tttataatca aggtttaggg agcgtgttgc cgccctcttt acaaaacgcg 4860 

ctcaaagaaa acgatttagg cactctttta tcgcctagag gcttgcatga tttttggcaa 4920 

30 aaagggtatt ttaacttttt aagcaatggc tatgtttttg tcaataacag ctcttttagt 4980 

aacgctactg ggggtagttt gaattttgtc gccaacaagt ctattatctt taatggcgat 5040 

aatacgattg actttagcaa gtatcaaggc gcattgattt ttgcttctaa tggtgtttct 5100 

aatatcaata tcaccaccct aaacgccact aatggcttaa gccttaatgc gggtttgaat 5160 

aatgtgagcg ttcaaaaagg agaaatttgt atcaatttag ccaattgccc tacaaccaaa 5220 

35 aacagctctc ctgcaaactc tagcgtaacc cccactaatg agtctttaag cgtgcacgct 5280 

aataatttca ctttcttagg cacaatcatc tctaatgggg ctattgattt gtctcaagta* 5340 

acaaataata gcgttatagg cacgctcaat ctcaatgaaa atgcgacctt gcaagctaat 5400 

aatttaacga tcaccaacgc ttttaacaac gcctctaact ctacggctaa tattgatggt 5460 

aatttcacct taaaccaaca agcgacttta agcactaacg ctagtggttt gaatgtcatg 5520 

40 gggaatttta atagctatgg cgatttggtg tttaacctca gtcattcagt tagtcatgct 5580 

attatcaata ctcaaggcac agcgacgatc atggccaata ataacccttt gatccaattc 564 0 

aacgcttctt caaaagaagt gggtacttac acgctgattg atagcgctaa agccatttat. 5700 

tacgggtata acaaccaaat cacaggaggc agtagcctgg ataattaccfc taagctttat 5760 

gcgctcattg atattaatgg caagcacatg gtgatgactg acaacggctt aacctataac 5820 

45 gggcaagccg tgagcgttaa agatggcggt ttagttgtag gctttaagga ctctcaaaat 5880 

caatacattt acacttccat tctttataat aaagtgaaaa tcgctgtttc taatgatcct 5940 

atcaataacc cacaagcccc cactttaaaa caatatatcg ctcaaattca gggcgttcaa 60 00 

agcgtggata gcatcgatca agctggggga aatcaagcga ttaattggct caataaaatc 6060 

tttgaaacta aaggaagccc tttattcgct ccctattatc tagagagcca ctccacaaaa 6120 

50 gatttaacca cgatcgctgg agatattgct aacactttag aagtcatcgc taaccctaat 6180 

tttaaaaatg acgccactaa tattttacag atcaacacct acacgcagca aatgagtcgt 6240 

ttagccaagc tctctgacac ttcaactttc gcccgttctg atttcttaga acgcttagaa 63 00 

gcccttaaaa acaagcgatt cgctgatgcg atccctaacg ctatggatgt gattttaaaa 63 60 

tactctcaaa ggaatagagt taaaaataat gtgtgggcga caggagttgg aggggctagt 6420 

55 ttcattagtg gaggtactgg aactttatat ggtatcaatg tagggtatga taggtttatt 6480 

aagggcgtga ttgtgggagg ttatgccgct tatgggtata gcgggtfccca tgcaaacatc 654 0 

actcaatcag gctctagcaa tgtcaatgtg ggcgtttata gccgagcgtt tatcaaaaga 6600 

agcgagctaa ccatgagctt gaatgagact tggggataca ataaaacttt catcaactcc 6660 

tatgaccccc tactctcaat catcaatcag tcttacagat acgacacttg gacgactgac 6720 

60 gctaaaatca attatggcta tgafcttcatg tttaaagata aaagcgttat ttttaaaccc 678 0 

caagtaggct taagctatta ttacattggt ttgtctggtt fcaaggggcat tatggatgat 6840 

cctatttaca accaattcag agccaatgct gaccctaata aaaaatccgt tctaacgatc 690 0 

aattttgccc tagaaagtcg gcattatttc aataaaaact cttattattt tgtgattgcg 6960 

gatgtgggca gagacttatt cattaattct atgggggata aaatggtgcg tttcatcggt 7020 

65 aataacaccc taagctatag agatggtggc agatacaaca cttttgctag cattatcaca 7080 

ggcggggaga taagattgtt caaaaccttt tatgtgaatg cgggcatagg ggctaggttt 7140 

gggcttgatt ataaagatat taatattacc ggaaatattg gtatgcgcta tgctttttaa 72 00 
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<212> Type : DNA 
<211> Length : 7200 

SequenceName : SEQ ID 476 
5 SequenceDescription : 

Sequence 



<213> OrganismName : Helicobacter pylori, strain J99 

10 <400> PreSequenceString : 

atggaaatac aacaaacaca ccgcaaaatc aatcgccctt tagtttctct cgttttagca 60 

ggagcgttga ttagcgccat accgcaagag agtcatgccg cctttttcac gaccgtgatc 120 

attccagcca ttgttggggg tatcgccaca ggcactgctg taggaacggt ctcagggctt 180 

cttagttggg gactcaaaca agccgaagaa gcgaataaaa ccccagataa acccgataaa 24Q 

15 gtttggcgca ttcaagcagg aaaaggcttt aatgaatttc ctaacaagga atacgactta 3 00 

tacaaatccc ttttatccag taagattgat ggaggttggg actgggggaa cgccgctagg 3 60 

cattattggg tcaaaggcgg gcaatggaac aagcttgaag tggatatgaa agacgctgta 420 

gggacttata aactatcagg gcttagaaac tttactggtg gggatttaga cgtgaatatg 480 

caaaaagcca ctttgcgttt gggccaattc aatggcaatt ctttcacaag ctataaggat 540 

20 agcgctgatc gcaccacgag agtgaatttc aacgctaaaa atatttcaat tgataatttt 6 00 

gtagaaatca ataatcgtgt gggttctgga gccgggagaa aagccagctc tacggttttg 660 

actttgcaag cttcagaagg gatcactagc agtaaaaatg cggaaatttc tctttatgat 720 

ggcgccacgc tcaatttggc ttcaaacagc gttaaattaa atggtaatgt gtggatgggc 780 

cgtttgcaat acgtgggagc gtatttagcc ccttcataca gcacgatcaa cacttcaaaa 840 

25 gttcaagggg aagtggattt taaccatctc actgtggggg atcaaaacgc cgctcaagcg 900 
ggcattatcg ctagcaataa gactcatatt ggcacactgg atttgtggca aagcgccggg - 960 

ttaaatatca ttgcccctcc agaaggtggc tacaaggata aacctaatag taccacttct 1020 

caaagtggca ctaaaaacga caagaaagag atcagtcaaa ataacaatag caacacagag 1080 

gtcattaacc cacccaataa cacgcaaaaa acagaaactg aacccacgca agtcattgat 1140 

30 gggccttttg ctggcggcaa agacacggtt gtcaatattt tccacttaaa cactaaagcc 12 00 

gatggcacga ttaaagtggg agggtttaaa gcttctctta ccacgaatgc ggctcatttg 12 60 

aatatcggca aaggcggtgt caatctgtcc aatcaagcga gcgggcgcac ccttttagtg 1320 

gaaaatctaa ccgggaatat caccgttgat gggcctttaa gagtgaataa tcaagtgggt 138 0 

ggctatgctt tggcaggatc aagcgcgaat tttgagttta aggctggtgt ggatactaaa 1440 

35 aacggcacag ccactttcaa fcaacgatatt agtttgggaa gatttgtgaa tttaaaggtg 1500 

gatgctcata cagctaattt taaaggtatt gatacgggta atggtggttt caacacctta- 1560 

gattttagtg gtgttacaga caaagtcaat atcaacaagc tcatcacagc ttccactaat 162 0 

gtggccgtta aaaacttcaa cattaatgaa ttgattgtta aaaccaatgg gataagtgtg 1680 

ggggaataca ctcattttag cgaagatata ggcagtcaat cgcgtatcaa taccgtgcgt 1740 

40 ttggaaactg gcactaggtc aatcttttct gggggtgtca aatttaaaag cggtgaaaaa 18 00 

ctagttatca atgattttta ctatagccct tggaattatt ttgacgctag gaatgttaaa 1860 

aatgttgaaa tcaccagaaa attcgcttct tcaaccccag aaaacccttg gggcacatca 1920 

aagctcatgt ttaataatct aaccttgggt caaaatgcgg tcatggacta tagtcaattt 198 0 

tcaaatttaa ccattcaggg ggattttatc aacaatcaag gcactatcaa ctatctggtc 2040 

45 cgaggcggga aagtggcaac cttaaatgta ggcaatgcag cagctatgat gtttaataat 2100 

gatatagaca gcgcgaccgg attttacaaa ccgctcatca agattaacag cgctcaagat 2160 

ctcattaaaa atacagagca tgttttattg aaagcgaaaa tcattggtta tggtaatgtt 222 0 

tctacaggta ccaatggcat tagtaatgtt aatctagaag agcaattcaa agagcgccta 228 0 

gccctttata acaataataa ccgcatggat acttgtgtgg tgcgaaatac tgatgacatt 2340 

50 aaagcatgcg gtatggctat cggcaatcaa agcatggtga acaaccctga caattacaag 2400 

tatcttatcg gtaaggcatg gagaaatata ggcatcagta aaacggctaa cggctctaaa 2460 

atttcggtgt attatttagg caattctacg cctactgaga atggtggcaa taccaccaac 252 0 

ttacccacaa acaccactaa taatgcgcat tctgctaact acgctctcgt gaagaacgct 258 0 

cctttcgctc acagcgccac tcctaattta gtcgctatca atcagcatga ttttggcact 2640 

55 attgagagcg tgtttgaatt ggctaaccgc tctaaagata ttgacacgct ctatactcat 2700 

tcaggcgcgc aaggcaggga tctcttgcaa actttattga ttgatagcca tgatgcgggt 2760 

tatgccagac aaatgattga taacacaagc accggtgaaa tcaccaagca attgaatgcg 2820 

gccactgacg ctttaaacaa cgtagccagt ttagagcata aacaaagcgg cttacaaacc 28 8 0 

ttgagcttga gtaatgcgat gattttaaat tctcgtttag tcaatctctc taggaagcac 2940 

60 accaaccata ttaactcgtt cgctcaacgc ttacaagctt taaaaggcca agaattcgct 30 00 

tctttagaga gcgcggcaga agtgttgtat caatttgccc ctaaatatga aaaacctacc 3060 

aatgtttggg ctaacgctat tgggggagcg agcttgaata gcggctctaa cgcttcattg 3120 

tatggcacaa gcgccggcgt agacgctttc cttaacggga atgtggaagc cattgtgggc 3180 

ggttttggaa gctatggtta tagctccttt agcaatcaag cgaactctct taactctggg 3240 

65 gccaataacg ctaattttgg cgtgtatagc cgtttttttg ccaaccagca tgaatttgac 33 00 

tttgaagctc aaggggcgct agggagcgat caatcaagct tgaatttcaa aagcactcta 33 60 

ttacaagatt tgaatcaaag ctataattac ttagcctata gcgccacagc aagagcgagt 3420 
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tatggttatg acttcgcgtt ttttaggaac gctttagtgt taaaaccaag cgtgggcgtg 3480 

agctataacc atttaggttc aaccaacttt aaaagcaata gccaatcaca agtggcttta 3540 

aaaaatggcg cgagcagtca gcatttattc aacgctaacg ctaacgtgga agcgcgttat 3 60 0 

tattatgggg acacttcata cttttatttg catgcgggag ttttacaaga gttcgctcac 3 660 

5 tttggatcga atgatgtggc gtctttaaac acctttaaaa tcaatgccgc tcgcagtcct 3720 

ttaagcacct atgcaagagc gatgatgggt ggggaattgc aattggctaa agaagtgttt 378 0 

ttgaatttgg gcgtggttta tttgcacaat ttgatttcca acgcaagcca tttcgcttcc 384 0 

aatttaggaa tgaggtatag tttctaa 3 867 
<212> Type : DNA 
10 <211> Length : 3867 

SequenceNarae : SEQ ID 477 

SequenceDe script ion : . 



<213> OrganismName : Helicobacter pylori, strain J99 
<400> PreSequenceString : 

atgaaaaaac acatcctttc attaacttta ggatcgcttt tagtttccac tttgagcgct 60 

gaagacgacg gcttttacac aagcgtaggc tatcagatcg gtgaagccgc tcaaatggta 120 

20 acaaacacca aaggcatcca agatctttca gatcgttatg aaagtttgaa caaccttttg 18 0 

aatagataca gcaccctaaa cacccttatc aaattgtccg ctgacccgag cgcgattaat 240 

gcggtgcggg aaaatctggg cgcgagcgcg aagaatttga tcggcgataa agccaattcc 3 00 

ccggcgtatc aagccgtgct tttagcgatc aacgcggcgg tagggttttg gaatgtcgta 3 60 

ggctacgtga cgcaatgcgg gggtaacgcc aatggtcaaa aaagcatctc ttcaaagacc 420 

25 atcttcaaca acgagccagg gtatcgatcc acttccatca cttgttcttt gaacgggcat 48 0 

tctcctggat actacggccc tatgagcatt gaaaatttca aaaagcttaa cgaagcctac 540 

cagatcctcc aaacggcttt aaaacgaggc ttgcccgcgc tcaaagaaaa caacgggaaa 600 

gtcaatgtaa cctatactta cacatgctca ggggacggga ataataactg ctcgtcacaa 660 

gtcacaggtg taaataatca aaaagacgga accaagacta aaatccaaac catagacggc 72 0 

30 aaaagcgtaa ccaccacgat cagttcaaaa gtggttgata gtcgtgcaga tggtaataca 780 

acaggggtgt cctacaccga aatcaccaac aaattagaag gtgtgcctga tagcgctcaa 84 0 

gcgctcttag cgcaagcgag tacgctcatt aacaccatca acaacgcatg cccgtatttc 90 0 

catgctagta atagtagtga ggctaacgcc ccaaaattct ctactactac tgggaaaata 960 

tgcggcgctt tttcagaaga aatcagcgcg atccaaaaga tgatcacgga cgcgcaagag 1020 

35 ctggtcaatc aaacgagcgt cattaacgag catgaacaaa caactccggt aggcaataac 1080 

aatggcaagc ctttcaaccc tttcacggac gctagttttg cgcaaggcat gctcgctaac 1140 

gctagtgcac aagccaagat gctcaatcta gccgaacaag tggggcaagc cattaaccct 1200 

gagaggctta gcgggacttt tcaaaatttt gttaaaggct ttttagccac atgcaacaac 1260 

ccatcaaccg ctggtactgg tggcacgcaa ggttcagctc caggcacagt taccactcaa 132 0 

^0 actttcgctt ccggttgcgc ctatgtagga caaacgataa caaatcttaa aaacagcatc 13 80 

gcccattttg gcactcaaga gcagcagata cagcaagccg aaaacatcgc tgacactctg 144 0 

gtgaatttca aatctagata cagcgaattg ggcaacactt ataacagcat caccactgcg 150 0 

ctctctaata tccctaacgc gcaaagcttg caaaatgcgg tgagtaaaaa gaataacccc 1560 

tatagcccgc aaggcataga caccaattac tacctcaatc aaaactctta caaccaaatc 162 0 

45 caaaccatca accaagaact cgggcgtaac ccctttagga aagtggggat tgttagttct 1680 

caaaccaata atggcgcgat gaatgggatc ggtattcagg tgggttacaa acaattcttt 1740 

ggccaaaaaa gaaaatgggg cgctaggtat tacggctttt ttgattacaa ccatgcgttc 18 0 0 

attaaatcca gcttcttcaa ctcggcttct gacgtgtgga cttatggctt tggagcggac 1860 

gctctttata atttcatcaa cgataaagcc accaatttct taggcaaaaa caacaagctt 192 0 

50 tctgtggggc tttttggggg tattgcatta gccgggactt catggcttaa ttctgagtat 19 80 

gtgaatttag ccaccatgaa taacgtctat aacgctaaaa tgaacgtggc gaatttccaa 2040 

ttcttattca acatgggagt gaggatgaat ttagccaggc ctaagaaaaa agacagcgat 2100 

catgcggctc agcatgggat tgagttaggg cttaaaatcc ccaccatcaa cacgaactac 2160 

tactccttta tgggggctga actcaaatac cgaaggctct atagcgtgta tttgaattac 2220 

55 gtgttcgctt attaa 2235 
<212> Type : DNA 
<211> Length : 2235 

SequenceName : SEQ ID 478 
SequenceDescription : 



<213> OrganismName : Helicobacter pylori, strain J99 
<400> PreSequenceString : 

65 atgataaaaa aagctaaaaa attcatacca ttctttttaa ttggctccct cttagctgaa 60 

gacaatggct ggtatatgtc tgtaggctat caaatcggtg gcacgcagca attcatcaat 120 

aacaaacaac ttttagaaaa tcaaaatatc atcaatagca tcactcaaag cgcgatcaac 18 0 
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attgcagggc ctactaccgg ccttatcact 
ggctatggcg tgagtaacac tgttggcaac 
caaattggca aaagaaaaga cttttattct 
atcatagggc ttaaaggaag ctctgatccc 
5 aaactccttt ccaacaccca aagcgcgttt 
attagtgcag tcaatagcct aaaccctagc 
caaaacaccg cgcaatccat ggcggaatta 
accactagca ccacttacgc acaatcctta 
tctagcaata ataccactta tgtgagcgct 
10 ggggttttcc ccaccacaac ctcaacgcat 
ttctatccaa ctaattccct tttaggctct 
tacaacaaca cccttttaat gaacacctta 
aaccccaatg gttgcgccaa tcaaatccag 
cctttagccg caacccccac ttcaactaac 
15 caaaaacttc aaagcgttgc tatcaacgct 
tataatttaa acaacttgca caacgctttg 
caatacaata acgctttaaa gcaaatttcg 
ctcaaaaaca cttccaataa ctaccaaatc 
atcagcgcct atgattgcac aagcgctacc 

20 atttcatgct cagccacaag ctccacaaat 
gctacctcca aagtccaaac catcaacggc 
cttgtctctc aagtgtggag cgtttataac 
aaaaacgcca aaatattatg caacaatgga 
tcttcagggg gtttgagcat cagcgggaac 

25 aatgggacta ccactaatac tcaagctaaa 
atggtgaata atgaagaaga agccaaaacg 
acacaatctt ctaacagcac ggtgatggga 
aatttccaac aaagcattca aagcgctttt 
gcgaacgcac tttataacac tagtaaccct 

30 aacaafcaacc aagatttacg catccaatta 
attaaccagc aagtgcctac agacatgaac 
caaacaagcg gafccagcaag caccacgaac 
ggcaactggt gcfcaccagca gtggtccgat 
gctttagggt atcaaacaca agcgacaact 

35 acctacaatg tccaacaaat cacgctcact 
aaccttaaga gcgttaatgg gggcagtaat 
atcaacacag cctaccaaat gctcacagac 
agtagcaata gtagcaatag tagcaatagt 
agcaccaacg ggagcaatgg gacgagtggg 

40 aacgccacca ccgcaaccac cacgaccgac 
caaaaaatag ccaatattat cgccagctct 
aaacaattct ttgaagcgtt aaaaagtaat 
ggtagtagcg gtagtagctc tacttgctcc 
cccacaaacg gagtgagcga tacgaataat 

45 accgccgggt ttatccaaaa taaggatagt 
caagccatta cgagcgctat ttctcaaggg 
aatgcgattt tgaccttgct ccaagaaatc 
tcgcaaacct tacggcagct tttaggggat 
attgatgcga tgattaacgc cagaaatcag 

50 tacggctctc aacccgtttt aagccagtat 
agcaatggct taggggttgg cataggctat 
ggccttaggc attatttttt ctttgattac 
agcgtgaaag cgaatatctt tgcttatggg 
aggaggactt acaacactaa agcgttgaat 

55 ggtgcaactt ggcttagttc cttaaggcaa 
gacatccatt caacgaattt tcaagtggcg 
gagtttaagc gttttgctaa gaaattccac 
gaatttggga tcaaggtgcc tctcatcaat 
gtgagctaca ggaggcttta tactttctat 

60 

<212> Type : DNA 

<211> Length. : 3714 

SequenceName : SEQ ID 479 
SequenceDescription : 

65 

Sequence 



ttaagctctc aaaccgtcat tgacgcttta 240 

caattagagg gcatttctaa catcttgaat 3 00 

agccgtcaaa tctctagcat ttcccagcaa 360 

ttaaaagccc attcttcaca aatcacagcc 420 

gatcagggca tcgctctaag ctctaatatc 48 0 

aacaactccc aagaagtcaa agcccagctc 540 

ttgcaacaaa ttgaacacag catcactaaa 60 0 

ctctccaatc tgaccgatgc ggtgaatgca 660 

cttgttaacg ctttaaacac tttaggggtg 72 0 

gtggtgctaa acccaccggg acaagtcgta 78 0 

acttcttcaa acagcaataa ccaacaacaa 840 

caaggggaat taagcactaa caatcaaaat 90 0 

tgtttagagc aattcatcca aaatttaacc 960 

caggccaacc agcaagtcca agccatcgct 1020 

ttagacaaca atgcgatcaa caacaccacc 10 8 0 

aatttccaag cctatcaaag cacgatagaa 1140 

tggattagtt ttagcgagcc taaaaacttg 120 0 

ggcacggtta ccaacgatca agggcaaaat 1260 

ggaagccttt ctagcgatgc ttctagtggg 132 0 

aacacaaata gttttgacaa ttctttagtc 1380 

aaagagcaga tcggcgtgaa ttcttttaat 1440 

tctttaaaaa cttcagaaga aaatttgcaa 1500 

tcgcaatctg ggacaagccc atgcaatagc 1560 

gcccaattgc aaaatatttt aagccctact 162 0 

agcaacgctt ccaaactaaa agcgatggta 1680 

accaatttca atcaaagcag tgggccaacc 1740 

gctttaaaca ccgtattgca aaatgtcagc 18 0 0 

caaaaccaag aaaataatat ccaagcttgg 1860 

aatgggaatc aatcgcaaaa tttaaccact 192 0 

agggcgaatt tttaccagct catcaatacc 198 0 

gctttaatta atcaaagcca acaaacccag 2040 

aacgcatgcg cgagcggaat ggggagtagt 210 0 

tctaaggctt attacagcgg gttgcaaagc 2160 

caaaatggga gcagtggtgg gagcaatatc 2220 

agcggtggtt tgctcaatca aattatcaca 228 0 

gggggaagca gtgggaatgg cactagtcaa 234 0 

gctagcgatg ggaaattagg gacttataat 240 0 

ggcaataata acggctatac gccatgcaat 2460 

agcaattgtt atgaacccaa caaacaacaa 2520 

agcaatttac aaaaagtcta taatgacgcc 258 0 

gggaacaata aaggcgttga aaacggctta 2640 

agcagcagtc ttagtaattt atgtggtaat 270 0 

ggtgggctta tcaacctttt aggggcaatc 2760 

ttaattaatc tgctcactga attcattaaa 282 0 

aatgtatcta ctagtcttac aagcgctttt 288 0 

tttcaagcct tgcaaaacga tattagccct 2940 

acttctaaca ccaccaccat tcagtcattc 30 0 0 

aaaaccttct ttatggtgca acaaaagctc 3060 

gttcaaaacg cgcaaaatca agccaataac 312 0 

gcggccgcta aaagcaccca acacggcatg 318 0 

aaatacttct ttggtaaggc taggaaatta 324 0 

ggctttagtg aaataggcct agccaatcaa 33 0 0 

gtaggcacgg attttttatg gaatctattc 3360 

tttgggctat ttgccggggt ccaactgggc 342 0 

caaatcattg acaactgggg gaacgctaat 348 0 

ctgaattttg gggtgcgcac caatttcgcg 3540 

aatcaagggg tcatcagcca aaagagcgtg 360 0 

caagcgtatt tgaatagtgc tggggctgat 3660 

atcaattaca tcatggggtt ttaa 3714 
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<213> OrganismName : Helicobacter pylori, strain J99 
<40 0> PreSequenceString : 

atgaaacaaa atttaaagcc attcaaaatg attaaggaaa atttaatgac acaatctcaa 60 

aaagtaagat tcttagcccc tttgagccta gcgttaagct tgagcttcaa tccagtgggc 12 0 

5 gctgaagaag atgggggctt tatgaccttt gggtatgaat taggtcaggt ggtccagcaa 180 

gtgaaaaacc cgggtaaaat caaagccgaa gaattagcgg gcctgttaaa ctctaccacg 240 

acaaacaaca ccaatatcaa tattgcaggc acaggaggga atgtcgccgg gactttgggc 3 00 

aaccttttta tgaaccaatt gggcaatttg attgatttgt atcctacttt gaaaactaat 3 60 

aatcttcacc aatgcggtag cactaatagc ggtaatggcg ctactgctgc cgctgctact 420 

10 aacaatagcc cttgtttcca aggtaacctg gctctttata acgaaatggt tgactctatc 480 

aaaactttga gtcaaaacat cagcaagaac atctttcaag gcgacaacaa caccacgagc 540 

gctaatctct ccaaccagct cagtgagttg aacaccgcta gcgtttattt . gacttacatg 600 

aactcgttct taaacgccaa caaccaagcg ggtgggattt ttcaaaacaa caccaatcaa 660 

gcttacgaga atggtgttac cgctcaacaa atcgcttatg tcctaaagca agcttcaatc 720 

15 actatggggc caagcggtga tagtggggct gcgggagcgt ttttagacgc cgctttagcc - 780 

caacatgttt tcaactcggc taacgctggg aacgatttga gcgctaagga attcactagc 84 0 

ttggtgcaaa acatcgtcaa taattctcaa aacgctttaa cgctagccaa caacgctaac 900 
atcagcaatt caacaggcta tcaagtgagc tatggtggga atattgatca agcgcgctct 960 

acccaactgt taaacaacac cacaaacact ttggctaaag ttaccgctct aaacaacgag 102 0 

20 cttaaagcta acccatggct tgggaatttc gctgctggta acagctctca agtgaatgcg 10 8 0 

tttaacgggt ttatcactaa aatcggttat aagcaattct tcggggaaaa caagaatgtg 1140 

ggcttacgct actacgggtt cttcagctat aacggcgcgg gcgtgggtaa tggccccact 12 00 

tacaatcaag tcaatctgct cacttatggg gtggggactg atgtgcttta caatgtgttt 1260 

agccgctctt ttggcagtag gagtcttaat gcgggcttct ttggggggat ccaactcgca 1320 

25 ggggacactt acatcagcac gctaagaaac agccctcagc ttgcgagcag acctacagcg 1380 

acaaaattcc aattcttgtt tgatgtgggc ttacgcatga actttggtat cttgaaaaaa 1440 

gacctaaaaa gccataacca gcattctata gaaatcggtg tgcaaatccc tacgatttac 1500 

aacacttact ataaagctgg tggcgctgaa gtgaaatact tccgccctta tagcgtgtat 1560 

tgggtctatg gctacgcctt ctaa 1584 

30 <212> Type : DNA 

<211> Length : 1584 

SequenceName r SSQ ID 480 
, SequenceDescription r 

35 Sequence 



<213> OrganismName : Helicobacter pylori, strain 0*99 
<400> PreSequenceString : 

atgaaaaaaa cccttttact ctctctctct ctctcgtttg ggctccacgc tgaagacgac 60 

40 ggcttttacg caagcgcggg aattcggatc ggtgaagccg ctcaaatggt gaaaaacacc 12 0 

aaaggcattc aacagctttc agagaattat gaaaagttga acaatctttt aaataattac 18 0 

aacaccctaa acactcttgt aaagctgtcc tccgatccga gtgctgtcaa cgacgcaagg 240 

gataatctag gctcaagcac taggaatttg ctagatgtca aagccaattc ccccgcctat 3 00 

caagcggtgc ttttagcatt gaacgctgca gtgggcttgt ggcaagttac aagctatgcc 360 

45 tttaccgctt gtggtcctgg tagcaatgag aacgcgaatg gaggtatcca aacctttaat 420 

aatgtgccag gacaaaacac gacgaccatc acttgtaatt cgtattatga gccaggacat 48 0 

ggcgggccaa tatccactaa aaattatgcg atcatcaaca aggcttatca aatcattcaa 540 

aaggctttga cagccaatgg agaagggatc ccagttttaa gcaacaccac tacaaaactt 600 

gatttcacta tcaatggaga caaaagaacg ggtggcgaac caaataaaaa attagtatac 660 

50 ccatggagtc atgggaaagc tatttcaacc tcgtggaatg caaccataac agcaccaaca 720 

acagaaaata tcaatacaac caatagcgct caagagcttt taaaacaagc gagcatcatt 780 

atcactaccc tgaatagtgc atgcccaaac ttccaaaatg gtggtagcgg ttattgggca 840 

gggataagtg gcaatgggac aatgtgtggg atgtttaaga atgaaatcag cgctatccaa 900 

ggcatgatcg ctaacgcgca agaagctgtc gcgcaagcca aaatcgttag tgaaaacacg 960 

55 caaaatcaaa acagcctaga cgctggaaaa ccattcaacc cctacacaga cgctagtttt 102 0 

gctgaaagca tgctcaaaaa cgcgcaagcc caagcggaga ttttaaacca agccgaacaa 1080 

gtggtgaaaa actttgaaaa aatccctaca gcctttgtaa atgactcttt aggggtgtgt 114 0 

tatgaagtgc aaggaggtga gcgtcgtggc accaatccgg gtcagacgac ttctaacact 1200 

tggggggcag gctgtgcgta tgtaggacaa acgataacaa atcttaaaaa cagcatcgcc 12 60 

60 cattttggca ctcaagagca gcagatacag caagccgaaa acatcgctga cactctggtg 1320 

aatttcaaat ctagatacag cgaattgggc aacacttata acagcatcac cactgcgctc 138 0 

tctaatatcc ctaacgcgca aagcttgcaa aatgcggtga gtaaaaagaa taacccctat 1440 

agcccgcaag gcatagacac caattactac ctcaatcaaa actcttacaa ccaaatccaa 1500 

accatcaacc aagaactcgg gcgtaacccc tttaggaaag tggggattgt tagttctcaa 1560 

65 accaataatg gcgcgatgaa tgggatcggt attcaggtgg gttacaaaca attctttggc 162 0 

caaaaaagaa aatggggcgc taggtattac ggcttttttg attacaacca tgcgttcatt 168 0 

aaatccagct tcttcaactc ggcttctgac gtgtggactt atggctttgg agcggacgct 1740 
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ctttataatt tcatcaacga taaagccacc aatttcttag gcaaaaacaa caagctttct 18 00 
gtggggcttt ttgggggtat tgcattagcc gggacttcat ggcttaattc tgagtatgtg 1860 
aatttagcca ccatgaataa cgtctataac gctaaaatga acgtggcgaa tttccaattc 1920 
ttattcaaca tgggagtgag gatgaattta gccaggccta agaaaaaaga cagcgatcat 1980 
5 gcggctcagc atgggattga gttagggctt aaaatcccca ccatcaacac gaactactac 2040 
tcctttatgg gggctgaact caaataccga aggctctata gcgtgtattt gaattacgtg 2100 
ttcgcttact aa 2112 
<212> Type : DNA 
<211> Length : 2112 
10 SequenceName : SEQ ID 481 

SequenceDescription : 

Sequence 



15 <213> OrganismName : Helicobacter pylori, strain J99 
<40 0> PreSequenceString : 

atgataaaga agaatagaac gctgtttctt agtctagccc tttgcgctag cataagttat 60 

gccgaagatg atggagggtt tttcaccgtc ggttatcagc ttgggcaggt catgcaagat 120 

gtccaaaacc caggcggcgc taaaagcgac gaactcgcca gagagcttaa cgctgatgta 18 0 

20 acgaacaaca ttttaaacaa caacaccgga ggcaatgtcg caggggcgtt gagtaacgct 240 

ttctcccaat acctttattc gcttttaggg gcgtatccca cgaaactcaa tggtaacgac 3 00 

gtgtctgcga acgctctttt aagtggtgcg gtaggcagtg ggacttgcgc ggctgcaggg 360 

acggctggtg gcacaactct taacactcaa agcgcttgca ccgctgcggg ctattactgg 42 0 

ctccctagct tgactgatag gattttaagc acgatcggca gccagactaa ctacggcacg 480 

25 aacaccaatt tccccaacat gcaacaacag ctcacctact tgaatgcggg gaatgtgttt 540 

tttaatgcga tgaataaggc tttagagaag aatgggactg ctactgctaa tagcactagt 60 0 

agcactagcg gtgcgactgg ttcagatggt caaacttact ctcaacaagc tattcaatac 660 

cttcaaggcc aacaaaatat cttaaataac gcagcgaact tgctcaagca agatgaattg 720 

ctcctagaag ctttcaactc tgccgtagct gctaacattg ggaataagga attcaattca 78 0 

30 gccgctttta caggtttggt gcaaggcatt attgatcaat ctcaattggt ttataacgag 840 

ctcactaaaa acaccattag cgggagcgcg gttaataacg ctgggataaa ctccaaccaa 900 

gctaacgctg tgcaagggcg tgctagtcag ctccctaacg ctctttataa cgtgcaagta 960 

actttggata aaatcaacgc gctcaacaat caggtgagaa gcatgcctta cttgccccaa 1020 

ttcagagccg ggaacagccg tgcaacgaat attttaaacg ggttttacac taaagtgggc 108 0 

35 tataagcaat fccttcgggaa gaaaaggaat atcggtttgc gctattatgg tttcttttct 1140 

tataacggag cgagcgtggg ctfctagatcc actcaaaata atgtagggtt atacacttat 1200 

ggggtgggga ctgatgtgtt gtataacatc tttagccgct cctatcaaaa ccgctctgtg 1260 

gatatgggct tttttagcgg tatccaatta gccggtgaga ccttccaatc cacgctcaga 1320 

gatgacccca atgtgaaatt gcatgggaaa atcaataaca cgcacttcca gttcctcttt 138 0 

.40 gacttcggta tgagaatgaa cttcggtaag ttggacggga aatccaaccg ccacaaccag 1440 

cacacggtgg aatttggcgt agtagtgcct acgatttata acacttatta caaatcagca 1500 

gggactaccg tgaagtattt ccgtccttat agcgtttatt ggtcttatgg gtattcattc 1560 

taa 1563 
<212> Type : DNA 

45 <211> Length : 1563 

SequenceName : SEQ ID 482 
SequenceDescription : 



Sequence 

50 

<213> OrganismName : Helicobacter pylori, strain J99 
<400> PreSequenceString : 

atgaagaaaa aatttctgtc attaacctta ggttcgcttt tagtttccgc tttaagcgct 60 

gaagacaacg gcttttttgt gagcgccggc tatcaaatcg gtgaatccgc tcaaatggtg 120 

55 aaaaacacca aaggcattca agatctttca gacagctatg aaagattgaa caacctttta 180 

acgaattata gcgtcctaaa cgctctcatc aggcagtccg ccgaccccaa cgccatcaat 240 

aacgcaaggg gcaatttgaa cgcgagcgcg aagaatttga tcaatgataa aaagaatt cc 300 

ccggcgtatc aagccgtgct tttagccttg aatgcggcag cggggttgtg gcaagtcatg 3 60 

agctatgcga tcagcccttg tggtcccggt aaagacacaa gcaaaaatgg gggcgttcaa 420 

60 actttccaca acacgccttc aaatcaatgg ggaggcacta ccattacttg tggcactact 480 

ggttatgaac caggaccata cagcatttta tccactgaaa attacgcgaa aatcaataaa 540 

gcttatcaaa tcatccaaaa ggcttttggg agcagcggaa aagatattcc tgccttaagc 600 

gacaccaaca cagaactcaa attcacaatc aataaaaata atggaaacac gaatacgaat 660 

aataatggag aagaaattgt tacaaaaaat aacgctcaag ttcttttaga acaggctagc 720 

65 accattataa ctacccttaa tagcgcatgc ccatggatca acaatggtgg tgcaggtggt 780 

gcgagtagtg gtagtttatg ggaaggaata tatttgaaag gcgatgggag cgcttgcggg 840 

atttttaaaa atgaaatcag cgcgattcaa gacatgatca aaaacgctgc aatagccgta 900 
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gagcaatcca agatcgttgc tgcaaacgcg caaaaccagc gcaacctaga caccgggaag 960 

acattcaacc cctataaaga cgccaacttc gcccaaagca tgttcgctaa cgccaaagcg 1020 

caagcggaga ttttaaaccg cgcccaagca gtggtgaaag actttgaaag aatccctgca 108 0 

gagttcgtaa aagactcttt aggggtgtgc catgaagtgc aaaacggcca tctccgtggc 1140 

5 acgccatccg gcacggtaac tgataacact tggggagccg gttgcgcgta tgtgggagag 12 0 0 

accgtaacga atctaaaaga cagcatcgct cattttggcg accaagccga gcgaatccat 1260 

aacgcgcgca acctcgccta cactttagcg aacttcagca gtcagtatca aaaactaggc 13 2 0 

gaacactatg acagcatcac agcggccatt tcaagcttgc ctgatgcgca atctttacaa 13 80 

aatgtggtga gcaaaaagac taaccctaat agcccacaag gcatacagga taactactat 1440 

10 attgactcca atatccattc tcaagtgcaa tctaggagtc aagaactcgg cagtaaccct 1500 

ttcaggcgtg ctggcttaat cgccgcttct accaccaata acggcgcgat gaacgggata 1560 

ggctfctcaag tgggctataa gcaattcttt gggaaaaaca aacgatgggg cgcaaggtat 1620 

tacggctttg tggattacaa ccacacctat aacaaatccc aatttttcaa cgcctctfcct 1680 

gatgtctgga cctatggcgt ggggagcgat ttgttagtga atttcatcaa cgataaagcc 1740 

15 actaaacaca ataagatttc ttttggcgcg tttggcggta tcgccttagc cgggacttca u 18 0 0 

tggcttaatt ctcagtatgt gaatttagcg aacgtgaata attattataa ggccaaaatc 1860 

aacacggcga atttccaatt cttattcaat ctgggtttga gaatgaacct cgctaggaaa 192 0 

aagcatagag cgaccgataa cgcggcccaa catggcattg aactaggcac aaagatcccc 1980 

acgatcaaca cgaattacta ttctttgcta ggcactacct tgcaatacag aaggctttat 2040 

20 agcgtgtatc tcaactatgt gttcgcttac taa 2073 
<212> Type : DNA 
<211> Length : 2073 

SequenceName : SEQ ID 483 
SequenceDe script ion : 



25 



Sequence 



<213> OrganismName : Helicobacte: 
<4 00> PreSequenceString : 

30 atgaaaatca aaaaatccct ctttgctctc 
gctgaagatg acggatttta catgagtgtg 
gtgaaaaaca ctggagcatt acaaaatctt 
ttaaaccaat acaattactt aaattcctta 
accggtgcga ttgacaatct aagctcaagc 

35 tctccggcct atcaagctgt ggctttagcg 
atagcctttg gcatcagctg tggccctggc 
gggggcgttc gatcgtttga caacacgcca 
accaccacca cttgtaatgg agccagtaat 
gaataccagg ttctcaatac cgcttatcaa 

40 ggaggcggga tgcctgcctt gaatagctcc 
ttcacaaaaa accctaccac agaatacact 
ggcggttcat caatcccaat ccagctaaag 
ttgcaacaag ccgctactat catcaatgtc 
ggcggtgggg catgggggtt tggcggtaag 

45 agttttaacg ctattaacga aatgatcaaa 
cagcttaacg ctaatgaaaa cacccaaatc 
tctaaagaca cgcagttcgc tcaagaaatg 
ttgagcttag cccaacaagt agcggacaat 
gatctagaag aatgcaccgc aggatcagct 

50 ggttgcgcgt ttgtgaaaga gactctcaat 
aaccaggtca atcaggatag ggctttgtct 
agcactttag ggaacgactc aaaagcgatc 
aagtcccttc aaaacatgac gcatgccact 
acttattctt tggataccag caaatacaac 

55 aaaaacccct ttaggcgcat cggcgtgatt 
ggcatcggcg tgcaagcggg ctataagcaa 
aggtattatg gtttctttga ttataaccat 
gcttctgatg tgtggactta tggggtgggt 
aaaaacacca actttttagg caaaaataac 

60 gcgttagccg ggacttcgtg gcttaattcc 
atttataacg ctaatgtcag cgcttctaac 
atgaacctcg ctaggcccaa gaaaaaagac 
ttgggcgtga aaatccccac cattaacacg 
aaatacagaa ggctctatag cgtgtatctc 

65 

<212> Type : DNA 
<211> Length : 2091 



pylori, strain J99 



tctttctctc tcatggcttc attatcaagg 60 

ggctatcaaa tcggtgaagc ggtccaaaaa 120 

gcagacagat acgataactt gagcaacctt 180 

gtcaatctag ccagcacgcc tagcgcgatt 240 

gcgatcaatc tcactagcgc taccaccact 300 

ctcaatgcgg ctgtgggcat gtggcaagtc 360 

cccaatcttg gcccagaaca tttagaaaat - 420* 

aactacagct acaacaccgg tagcggaacg 480 

gtagggccca atggtatcct atctagcagc 540 

actatccaaa ccgctttaaa ccaaaaccaa 600 

aaaaatatgg tagtcaatat caatcaaact 660 

taccccgatg ggaatggcaa ttattattca 720 

attagtagcg tcaatgacgc tgaaaacctt 780 

cttaccaccc aaaacccgca tgtgaatggt 840 

accgggaatg tgatggatat ttttggcgat 900 

aacgctcaag ccgttttaga aaaaacccaa 960 

acgcaaccag acaatttcaa cccctacact 1020 

ctcaatagag ctaacgctca agcagagatt 10 80 

ttccacagca ttcaagggcc tatccaacaa 1140 

ggtgtgatta acgacaacac ttatggttca 12 00 

tccttagagc aacacaccgc ttattatggc 1260 

caaaccattt tgaattttaa agaagccctt 13 20 

aatagcggta tctctaactt gcctaacgct 13 80 

caaaacccta attccccaga aggtttgctc 1440 

cagctccaaa ctgttgcgca agaattaggc 1500 

aactatcaaa acaataacgg ggcgatgaac 1560 

ttctttggca aaaaaaggaa ttgggggtta 1620 

gcttatatca aatctaattt ttttaactcg 1680 

atggacgcgc tttataactt catcaacgat 1740 

aagctttctg tggggctttt tggtggcttt 1800 

caacaagtga atttgaccat gatgaatggc 1860 

ttccaatttt tgtttgattt aggcttgaga 1920 

agcgatcatg ccgctcagca tggcatggaa 1980 

gattattatt ctttcatggg ggctgaactc 2040 

aattatgtgt ttgcttacta a 2091 
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SequenceName : SEQ ID 484 
SequenceDescription : 

Sequence 
5 

<213> Organi smName : Helicobacter pylori, strain J99 
<400> PreSequenceString : 

atgaaaatca aaaaatccct ctttgctctc tctttctctc tcatggcttc attatcaagg 60 

gctgaagatg acggatttta catgagtgtg ggctatcaaa tcggtgaagc ggtccaaaaa 12 0 

10 gtgaaaaaca ctggagcatt acaaaatctt gcagacagat acgataactt gagcaacctt 180 

ttaaaccaat acaattactt aaattcctta gtcaatctag ccagcacgcc tagcgcgatt 240 

accggtgcga ttgacaatct aagctcaagc gcgatcaatc tcactagcgc taccaccact 3 00 

tctccggcct atcaagctgt ggctttagcg ctcaatgcgg ctgtgggcat gtggcaagtc 3 60 

- atagcctttg gcatcagctg tggccctggc cccaatcttg gcccagaaca* tttagaaaat 420 

15 gggggcgttc gatcgtttga caacacgcca aactacagct acaacaccgg tagcggaacg 480 

accaccacca cttgtaatgg agccagtaat gtagggccca atggtatcct atctagcagc 540 

gaataccagg ttctcaatac cgcttatcaa actatccaaa ccgctttaaa ccaaaaccaa 600 

99 a gg c 9gga- tgcctgcctt gaatagctcc aaaaatatgg tagtcaatat caatcaaact 660 

ttcacaaaaa accctaccac agaatacact taccccgatg ggaatggcaa ttattattca 720 

20 ggcggttcat caatcccaat ccagctaaag attagtagcg tcaatgacgc tgaaaacctt 780 

ttgcaacaag ccgctactat catcaatgtc cttaccaccc aaaacccgca tgtgaatggt 840 

ggcggtgggg catgggggtt tggcggtaag accgggaatg tgatggatat ttttggcgat 90 0 

agtttfcaacg ctattaacga aatgatcaaa aacgctcaag ccgttttaga aaaaacccaa 960 

cagcttaacg ctaatgaaaa cacccaaatc acgcaaccag acaatttcaa cccctacact 1020 

25 tctaaagaca cgcagttcgc tcaagaaatg ctcaatagag ctaacgctca agcagagatt 1080 

ttgagcttag cccaacaagt agcggacaat ttccacagca ttcaagggcc tatccaacaa 1140 

gatctagaag aatgcaccgc aggatcagct ggtgtgatta acgacaacac ttatggttca 12 00 

ggttgcgcgt ttgtgaaaga gactctcaat tccttagagc aacacaccgc ttattatggc 1260 

aaccaggtca atcaggatag ggctttgtct caaaccattt tgaattttaa agaagccctt 1320 

30 agcactttag ggaacgactc aaaagcgatc aatagcggta tctctaactt gcctaacgct 13 80 

aagtcccttc aaaacatgac gcatgccact caaaacccta attccccaga aggtttgctc 1440 

acttattctt tggataccag caaatacaac cagctccaaa ctgttgcgca agaattaggc 1500 

aaaaacccct ttaggcgcat cggcgtgatt aactatcaaa acaataacgg ggcgatgaac 1560 

ggcatcggcg tgcaagcggg ctataagcaa ttctttggca aaaaaaggaa ttgggggtta 1620 

35 aggtattatg gtttctttga ttataaccat gcttatatca aatctaattt ttttaactcg 1680 

gcttctgatg tgtggactta 1 tggggtgggt atggacgcgc tttataactt catcaacgat 1740 

aaaaacacca actttttagg caaaaataac aagctttctg tggggctttt tggtggcttt 1800 

gcgttagccg ggacttcgtg gcttaattcc caacaagtga atttgaccat gatgaatggc 1860 

atttataacg ctaatgtcag cgcttctaac ttccaatttt tgtttgattt aggcttgaga 1920 

40 atgaacctcg ctaggcccaa gaaaaaagac agcgatcatg ccgctcagca tggcatggaa 1980 

ttgggcgtga aaatccccac cattaacacg gattattatt ctttcatggg ggctgaactc 2040 

aaatacagaa ggctctatag cgtgtatctc aattatgtgt ttgcttacta g 2091 

<212> Type : DNA 
45 <211> Length : 2091 

SequenceName : SEQ ID 485 
SequenceDescription : 



50 



Sequence 



<213> Organi smName : Helicobacter pylori , strain J99 
<4 00> PreSequenceString : 

ttgcataaaa aagttctgtt ggctttaact gccagcttga tttgccaaga gtctttgttc 60 

gctaaggata aagactacac tttgggcaag gtttctactg ccggtaaaaa ggatagatct 120 

55 gactattctg ggcaggtcaa tttgggttat agcgggatta ccgcgcctaa gagttggcaa 180 

gatgaagaag tgaaaaaata cacaggaagc cgcacggtga tctctaacaa agcgctcacc 240 

caacaagcta accaaagcat tgaagaagct ttacagaatg tccccggtct gcaaattagg 3 00 

aatgccacag gtgtgggggc tatgcctact atccaaatcc gtggctttgg agcggggggt 360 

tcagggcata gcgatgcgac gctcatgtta gttaatggta ttcctgttta tatggcccct 420 

60 tacgctcaca ttgagctaga cattttccct gttacctttc aagccattga tcgcattgat 480 

gtgatcaaag gtggaggcag cgtgcaatat gggcctaaca cttatggggg tattgtcaat 540 

atcatcacta aacctatccc taatcaatgg gaaaaccaag cggctgaaag gatcacttat 600 

tgggctaagg ctagaaacgc tgggtttgcc gctccccctg ataaaaccgg cgatccttct 660 

ttcatcaagt ctttaggcaa caacctcctc tataacactt atgtgaggag cggagggatg 720 

65 atcaataagc atgtgggtat ccaagcgcaa gctaactggg ttagaggcca aggctttagg 780 

gacaatagcc cctctagtat ttcaaactat tggctggatg gggtctatga catcaatgaa 840 

agcaatggga ttaaagccta ttaccaatac tacgattttg ctatcgccca accgggatca 900 
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ctcagcgagc aagattacaa aataaaccgc ttcgctaatt tgcgcccctt aaaccaaaaa 960 

ggcgggcgct cacaacgctt tggggctgtg tatgaaaacc gcttcgggga tttagacaga 1020 

gtgggcggga ctttcagctt cacttactac gggcagttga tgactaggga ctttcaggtg 10 80 

agctctagct acaatagcgc taacatggtt acttgtttta gcgaagcggc atgcagggcg 1140 

5 gcagggcttc cggcagggta taacttggct gtgccttatt atgccactaa ctacaatggt 12 0 0 

tsggcggagg tagaaaaccc tgtgcgttcc attaacaacg cttttgagcc taaagtgaat 12 60 

ctgatcgtca ataccgggaa agtcaggcaa acctttatca tgggtttgcg tttcatgacc, 1320 

accacttfctt tacaacgcca atacttaaac accaatgaat gcgccactaa aacgagcggt 13 80 

gagggggcag gcttcttgtg tgagggccct aacgtgatga gcggttggaa accccacatc 1440 

10 aagcatggcg tttatagaaa ctggaataac tggcgcaaca attacacagc ggtctatttg 1500 

agcgatcgca ttgaagcttg ggacgggcgc tttttcatcg tgcctggttt gcgctacgct 15 60 

tttgtgcaat acaacaacga aaatgcgtct aactggatgc aaatccctga gaaggattta 162 0 

agaaaaatca agcacatgaa caattggatg ccctcaacca acattggctt tatccctgtg 1680 

..caaggpgatG acaatgtgct tacctactfcc aactaccaac gctctttcgt - cccgcctcaa 1740 

15 ttagacgttt tgagctatgg aggagcggag tattttaccc aacactttga cacggtggaa 18 0 0 

gcaggagcgc gctacaccta taaagataaa ttcagcttca atgcggacta ctttaggatt 18 60 

tgggcgcgcg attttgccac cgggcagtat tcagtctata cgagcgggcc catgaaaggt 1920 

aatgtgcgcc ccattaatgg ctattctcaa ggcgtggagc tggaattgta ttacaggccc 198 0 

attagagggt tgcaattcca tgccgctttc aactacattg acactcgtgt aactagccat 2040 

20 ggccctttaa ccgacttgaa cggggatgtg ctaaaaggga ctagctataa caagcatttc 2100 

ccttttgtaa gccctttcca attcattttt gacgctcgct acaattggcg taaaaccacc 2160 

attggtattt ctagctattt ttatagccgt gcttatagcg ggattagcaa cagcgcagca 2220 

ggaggctatt atgggatgca atactatagt ggggggaaca actatgaaag cgttcttaat 22 80 

agcggttatc aatgcgaagc ttggtgtatg acccaacatg aagggctctt gccttggtat 2340 

25 tgggtgtgga atatccaagt gagccaaatt ttctgggaaa acggaagaca cagagttaca 24 00 

ggaagcttac aaatcaataa catcttcaac atgaagtatt attttacagg gattggctct 24 60 

agccctgcag gcttgcaacc tgcgcctgga agatcggtta cagcgtattt gaactacact 2520 
ttctaa 

<212> Type : DNA 
30 <211> Length : 2526 

SequenceName : SEQ ID 486 
SequenceDescription t 



35 



Sequence 



2526 



<213> Organ! smName : Helicobacter pylori, strain J99 
<400> PreSequenceString : 

atgaaaaaaa cccttttact ctctctctcc gcttcatcgc ttttaaacgc tgaagacaac 60 

ggctttttta tcagcgcggg ctatcaaatc ggtgaagccg ctcaaatggt gaaaaacacc 120 

40 ggcgaattga aaaaactttc agacacttat gagaatttga gcaacctttt aaccaatttt 180 

aacaacctca atcaggcggt aacgaacgcg agcagccctt cagaaatcaa tgctgcgatc 240 

gataatttaa aagcaaacac gcaagggcta attggcgaaa aaaccaattc cccggcgtat 3 00 

caagcggtgt atttggcgct caatgcggcg gtagggctgt ggaatgtcat cgcctataat 3 60 

gtccaatgcg gtcctggtaa cagtggacaa caaagcgtaa cctttgaggg ccaaccagga 420 

45 cataattcaa gttccattaa ttgcaattta accggttata acaacggggt tagcggccct 4 80 

ttatccattg agaattttaa aaagcttaat caggcttatc aaactatcca acaagcttta 540 

aaacaagata gcggatttcc tgttttggat agtgcaggaa aacaagtaac tataacaata 600 

acaacgcaaa ctaatggagc taataaaagt gaaactacta ctactactac tactactaat 660 

gacgctcaaa cccttttgca agaagccagt aaaatgataa gcgtcctcac tacaaactgc 72 0 

50 ccatgggtca atcacaatca aggacaaaac gggggcgcgc cgtggggttt agatacggca 780 

gggaatgtgt gtcaggtttt tgccacggaa tttagcgccg ttactagcat gatcaaaaac 840 

gcccaagaaa tcgtaacgca agctcaaagc cttaaccagc aaaacaatca aaacgcgccg 9 00 

caagatttca atccttacac ctctgctgat agggctttcg ctcaaaacat gctcaatcac 960 

gcgcaagcgc aagccaagat acttgagcta gccgatcaaa tgaaaaaaga ccttaacact 1020 

55 atcccaagcc aatttatcac aaattacttg gcagcttgcc acaatggggg tgggacatta 10 80 

cctgatgcgg gggttactaa caacacttgg ggggccggtt gcgcgtatgt ggaagagacg 1140 

ataacggctt taaacaacag ccttgcgcat tttggcactc aagctgagca aatcaagcaa 12 00 

tctgagttgt tggcgcgcac catacttgat tttagaggca gccttagtaa tttaaacaac 12 60 

acttataaca gcatcaccac gaccgcttca aacacgccta attccccatt ccttaaaaat 1320 

60 ttgataagcc aatccactaa ccctaataac cccgggggct tacaggccgt ttatcaagtc 13 80 

aaccaaagcg cttattcgca attattaagc gccacgcaag aattagggca taaccctttc 144 0 

agacgcgttg gattaatcag ctctcaaacc aacaatggtg ccatgaatgg gatcggtgtg 15 00 

caagtgggct acaaacaatt ttttggtgaa aagagaaggt gggggttaag gtattacggc 1560 

ttttttgact ataaccatgc ttatatcaaa tctagctttt tcaattcggc ttctgatgtg 1620 

65 ttcacttatg gggtagggac agatgtcctc tataacttta tcaatgataa aaccaccaaa 168 0 

aacagcaaga tttcttttgg ggtgtttggg gggattgcgt tagctggcac ttcatggctg 1740 

aattcccagt atgtgaattt agcgaccttc aataatttct atagcgctaa aatgaatgtg 18 0 0 
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gcgaatttcc aattcttgtt caatttaggc ttgagaatga acctcgctaa gaataagaaa 1860 
aaagcgagcg atcatgcggc tcagcatggc gtggaattag gcgtgaagat ccccacgatc 192 0 
aacacgaatt actattcttt gctaggcact caactccaat accgaagatt gtatagcgtg 198 0 
tatttgaatt atgtgttcgc ttactaa 2007 
5 <212> Type : DNA 

<211> Length : 2007 

SequenceName : SEQ ID 487 

SequenceDescription : 

10 Sequence 



<213> OrganisrnName : Helicobacter pylori, strain J99 
<40 0> PreSequenceString : 

atgagaaaac tattcatccc acttttatta ttcagcgctt tagaagcgaa* cgagaaaaac 60 

15 ggctttttca tagaagccgg ctttgaaact gggctattag aaggcacaca aacgcaagaa 12 0 

aaaagacaca ccaccacaaa aaacacttac gcaacttaca attatttacc cacagacacg 180 

attttaaaaa gagcggctaa tttattcacc aatgccgaag cgatttcaaa attaaaattc 240 

tcatctttat cccctgttag agtgttgtat atgtataatg gtcaattaac tatagaaaac 3 00 

ttcttgcctt ataatttaaa taatgttaag cttagtttta cagacgctca aggcaacacg 3 60 

20 attgatctag gcgtgataga gaccatcccc aaacactcta agattgtttt acccggggag 42 0 

gcgtttgata gtttaaaaga ggcgtttgat aaaattgacc cctatacttt atttcttcca 48 0 

aaatttgaag ccactagcac ttctatttct gatactaaca cgcagagggt gtttgaaacg 540 

ctcaataaca ttaaaacaaa tcttataatg aaatatagta atgaaaatcc aaacaatttc 60 0 

aacacttgtc cttacaataa taatggtaat acaaaaaatg attgttggca aaatttcacc 660 

25 ccacaaaccg cagaagaatt caccaattta atgttgaaca tgatcgctgt cttagactcc 72 0 

caatcttggg gcgatgcgat cttaaacgct ccttttgaat tcactaacag ctcaacagat 78 0 

tgcgatagcg atccttcaaa atgcgtaaat cccggagtaa atgggcgtgt tgatactaaa 840 

gtcgatcaac aatatatact caacaaacaa ggtattatta ataattttag aaaaaaaata 900 

gaaattgatg cggttgtttt aaaaaattca ggggttgtag ggttagccaa tggatatggc 9 60 

30 aatgatggtg aatatggcac attaggggta gaagcctatg ctttagatcc taaaaaactc 1020 

tttggcaacg accttaagac tatcaattta gaagatttaa gaaccatctt gcatgaattc 1080 

agccacacta aaggctatgg gcataacggg aatatgacct atcaaagagt gccggtaacg 1140 

aaagatggtc aagtggaaaa ggatagtaat ggcaagccaa aagattctga tggcctcccc 12 00 

tataatgtgt gttcgcttta tgggggatcc aatcagcccg ctttccctag caactaccct 1260 

35 aattccatct atcacaattg tgcggatgtc ccggctggct ttttaggggt aacagcagcg 1320 

gtttggcagc agctcatcaa tcaaaacgcc ttgccgatca actacgctaa cttggggagt 13 80 

caaacaaact acaacctaaa cgctagttta aacacgcaag atttagccaa ttccatgctc 1440 

agcaccatcc aaaaaacctt tgtaacttct agcgttacca accaccattt ttcaaacgca 15 00 

tcgcaaagtt ttagaagccc tattttaggg gttaacgcta aaataggcta tcaaaactac 1560 

40 tttaatgatt tcatagggtt ggcttattat ggcatcatca aatacaatta cgctaaagct 1620 

gttaatcaaa aagtccagca attgagctat ggtgggggga tagatttgtt attggatttc 1680 

atcaccactt actccaataa aaatagccct acaggcattc aaaccaaaag gaatttttct 1740 

tcatcttttg gtatctttgg ggggttaagg ggcttgtata acagctatta tgtgttgaac 18 0 0 

aaagtcaaag gaagcggcaa tttagatgtg gctaccgggt tgaactaccg ctataagcat 1860 

45 tctaaatatt ctgtagggat tagcatccct ttaatccaaa gaaaagctag cgtcgtttct 1920 

agcggtggcg attatacgaa ctcttttgtt ttcaatgaag gggctagcca ctttaaggtg 19 80 

tttttcaatt acgggtgggt gttttag 2 0 07 
<212> Type : DNA 
<211> Length : 2007 

50 SequenceName : SEQ ID 48 8 

SequenceDescription : 



Sequence 



55 <213> OrganisrnName : Helicobacter pylori, strain J99 
<400> PreSequenceString : 

atgaataaaa caacaattaa aatattaatg ggcatggcgt tattatcatc gcttcaagcc 60 

gcagaggcag agcttgatga aaaatcaaaa aaacctaaat ttgcggatag gaatacgttt 12 0 

tatttagggg ttgggtatca gcttagcgcg atcaacacgt cttttagcac cagttctata 180 

60 gataaatcgt atttcatgac cggcaatggt tttggcgtgg tgttgggggg gaaatttgtg 240 

gctaaaacgc aagctgtaga gcatgtgggt tttcgttacg ggttgtttta tgatcagacc 3 00 

ttttcttctc acaaatccta tatttctacc tatggtttag aatttagcgg tttgtgggac 3 60 

gctttcaatt cgccaaagat gtttttgggg ttggagtttg gcttaggcat cgctggggcg 420 

acttacatgc caggaggggc catgcatggg attatcgctc aatatttagg caaagaaaat 480 

65 tcgcttttcc aattgcttgt gaaagtgggt tttcgttttg gctttttcca caatgaaatc 540 

acctttgggt tgaaattccc tgtcattcct aacaaaaaaa cggaaatcgt tgatggcttg 600 

agcgcgacca ctttatggca acgcttgccg gtagcctatt tcaattatat ctataatttt 660 
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tag 663 
<212> TYpe : DNA 
<211> Length : 663 

SequenceName : SEQ ID 489 
5 SequenceDescription : 

* 

Sequence 



<213> Organi smName : Helicobacter pylori, strain J99 

10 <400> PreSequenceString : 

atgaaaaaaa cgaaaaaaac gattctgctt tctctaactc ttgcggcgtc attgctccat 60 

gctgaagaca acggcgtttt tttaagcgtg ggctatcaaa tcggtgaagc ggttcaaaag 120 

gtgaaaaacg ccgacaaggt acaaaagctt tcagacgttt atgaacaatt aagcaagctt 18 0 

t.tagccaacg ataatggcac tagctcaaaa acaagcgcgc aagcgatcaa. tcaagcggtt 240 

15 aataatttga atgaaagcgc aaaaacttta gccggtggga caaccaattc ccctgcctat 3 00 

caagccacgc ttttagcatt gagatcggcg ttagggttat ggaatagcat gggctatgcg 3 60 

gtcgtatgcg gaggttatat taaaaaaccg ggcgaaaaca atcaaaaaaa tttccactac 42 0 

accgatgaga atggcaacgg cactacaatc aattgcggtg ggagcacaaa tagtaatggc 480 

actcatagtc ctaatggcac aaatacatta aaagcagaca aaaatgtttc tctatctatt 540 

20 gagcaatatg aaaaaatcca tgaagcctat caaatccttt caaaggcttt aaaacaagct 600 

gggcttgctc ctttaaatag caaaggggaa aagttagaag cgcatgtaac cacatcaaag 660 

gatcaacaag gaacatccag tgaccaaact acaaccacaa cttctgttat tgatacgact 72 0 

aatgatgcgc aaaatctttt gactcaagcg caaacgattg tcaataccct taaagattat 780 

tgccccatgt tgatagcgaa atctagtagt aatggtggaa ctaatggcgc aaacacccct 840 

25 tcatggcaaa cagccggtgg cggcaaaaat tcatgtgcga cttttggtgc ggagtttagt 9 00 

gctatttcag acatgattag taacgctcaa aaaatcgttc aagaaaccca acaacttaac 9 60 

gccaaccaac ccaaaaatat cacccaaccc aataatttca accttaactc tcctggcagt 1020 

cttacggctt tagctcaaag catgctcaaa aacgctcaat ctcaaacaga aattttaaaa 10 80 

ttagccaatc aggtagcaag cgattttgac aaactttctt caggctatct taaagattac 1140 

30 atagggaaat gcgatgtgag tggtgtgagt agttcaaata tgacaccgca aaatatgaat 12 0 0 

accacttggg ggaaaggctg cgcgggcgtg gaagaaactc taacttcgtt aaaagcaagc 1260 

accactgatt ttaacaacca gacaacgccc caactcgatc aagcgcaaac cctagccaat 13 20 

acccttactc aagaactcgg caataaccct ttcaaacgag tgggtatcat tggctctcaa 13 80 

accaataacg gggcgafcgaa tggccttggg gtgcaagcgg gttataagca attctttggt 1440 

35 caaaaaagaa ggtgggggtt aaggtattac ggcttttttg actacaacca tacctacatc 1500 

aaatccagct tttttaactc gtcttctgac gttttgactt atggggtggg tagcgattta 1560 

ttgtttaatt tcatcaatga taaaaacacc aatttcttag gcaagaacaa taagatttct 1620 

gtggggcttt ttggaggtat cgccttagca gggacttcgt ggcttaattc tcaattcgtg 1680 

aatttaaaaa ccatcagcaa tgtttatagc gctaaagtga atacggctaa tttccaattc 1740 

40 ttattcaatt taggcttgag aaccaatctc gctaggccta agaaaaaaga cagcgatcat 1800 

tccgcgcaac atggcatgga attgggcgtg aaaatcccta ccattaacac gaattactat 1860 

tcttacttgg gaactaaact agaataccga agactctata gcgtgtatct caattatgtg 1920 

tttgcgtatt ga 1932 
<212> Type : DNA 

45 <211> Length : 1932 

SequenceName : SEQ ID 490 
SequenceDescription : 



50 



Sequence 



<213> OrganismNarae : Helicobacter pylori, strain J99 
<40 0> PreSequenceString : 

atgaaaaaaa cgattttact ttctctcatg gtgtcatcgc tctttgctga aaatgacggc 60 

gtttatatga gcgtgggcta tcaaatcggc gaagccgcac aaatggtgaa aaacaccggc 12 0 

55 gaaatccaaa aagtctccaa cgcttacgaa aatttgaaca accttttaac ccgctataat 180 

gaactcaaac aaacggcctc taacactgat tcaagcaccg ctcaagcgat tgacaatcta 240 

gaaaagagcg ctagcagatt gaaaacgacc cctaataccg ccaatcaagc cgtgtcctca 3 00 

gcgctcagct ctgcggtggg catgtggcaa gtgatagcct ctaatttagc caacaactcg 3 60 

ctatcttcta gcgaatacga aaaactcaa'a gcgacttctc aattgctcca aaatacccta 420 

60 gaaaataaaa acaataatct taaaattgaa aatgactatg accagctttt aactcaagct 48 0 

agtaccatta ttaataccct tcaaagccaa tgcccaggcg tagatggggg caatggcaaa 540 

ccatggggca ttaatacaag cgggaacgca tgcgctattt ttggtagcac ctttaacgcc 600 

attaatagca tgattgatag cgctaaaaaa gccgccgcag atgcccgaag aactgcccca 660 

gaaagtccaa accaacaaaa cgcgtttacc aacgctgatt tcaataaaaa cctcaatcaa 720 

65 gtctcaagcg ttatcaatga caccatctct tacctcaaag gggacaattt agaaaccatc 780 

tacaacacca ttcaaaaaac gcctaattct aaagggtttc aaagtttggt gagccggtct 840 

agctatagtt attctctcaa cgaaacccaa tattctcaat tccaaactac caccaaagag 900 
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tttggtcata accccttcag aagcgtggga ttaatcaact ctcaaagcaa taacggggcg 960 

atgaatggcg tgggcgtgca actaggctat aagcaattct ttgggaaaaa taaatttttt 1020 

gggatccggt attatggctt ttttgattac aactatgcgt atatcaaatc caattttttc 1080 

aactccgctt ccaatgtttt cacctatggc gcgggcagtg atcttfctatt gaacttcatc 1140 

5 aatggcggat ccgatcgaaa ccgcaaagtc tcttttggca tttttggggg catcgctcta 1200 

gcgggaacga catggcttaa taaccaatct gcgaatttaa aaatcaccaa tagcgcctac 12 60 

agcgctaaga tcaacaacac caatttccaa tfccttgttca ataccggttt aaggcttcaa 1320 

gggatccatc atggcattga afctaggcgtg aaaatcccta cgatcaacac caattactat 13 8 0 

tctttcatgg gcgctaaatt agcctaccgc aggctttata gcttgtacct caattatgtt 1440 

10 ttggcttatt ga 1452 



<212> Type : DNA 
<211> Length : 1452 

SequenceName : SEQ ID 491 
.... SequenceDescription : 

15 

Sequence 



<213> Organ isixiName : Helicobacter pylori, strain J99 
<400> PreSequenceString : 

20 atgccaaagg caagtcaggt tttattcttt ggagcgtttt taagcacttc tttacaaggt 60 

tttgaagcta agctcaacgg ctttgtggat caatccagca cgatcggttt taaccagcat 120 

aaaatcaata aagaaagagg catctaccct atgcagcaat tcgcaacgat tgcgggctat 180 

ttagggcttg gttttagcct gttacccaaa aaggtttcag accatgttct aaaaggcaaa 240 

atagggggca tggtcggatc tattttctat gacggcacga agaagtttga agacggctct 3 00 

25 gtggcttaca acctctttgg ttattacgat gggtttatgg gggtctatac taatatctta 36 0 

caaaccgata gccttgagac acagaacatg aaacacaaca aaaatgtccg caattatgtc 42 0 

tttagcgacg cgtatttaga atacgcttat aagaattatt ttgaaataaa agccgggcgc 48 0 

tatctctcca ctatgcctta taaaagcggt caaacgcaag gctttcaagt ttctgggcaa 540 

tacaagcatg cgcgcttgac ttggtttagc tcatggggta gggcgtttgc ttatggttcg 60 0 

30 tttttaatgg attggtttgc cgcacggacc acttatagcg gaggctttac caaaaacaat 660 

aatggaggtt atgatagcca tgggcgaaag gtgctttatg gcacgcatgc ggtgcaactc 72 0 

acctataaac ctcatcgttt cctcatagaa ggcttttatt acctttcgcc tcaaatcttt 780 

aacgctccag gcgttaagat tggttgggac tctaacccta attttagcgg cacaggcttt 840 

- cgctctgata cggctatcat agggtttttc cccatttact acccttggat gatcgttaaa 90 0 

35 tccaatggaa gcccggtcta tagatacgac acgcctgcca ctcaaaacgg gcaaaacctc 960 

attatccgcc aacgctttga catcaacaat tacaatgttt caatcgcttt ttataaagtc 102 0 

tttcaaaacg ctaatggttg gataggcaac atggggaatc caagcggtgt gatcatgggg 108 0 

agtaacagcg tctatgcagg ttttacaggc acagccctta aaagagacgc cgctaccatt 1140 

ttcctttctt gtggtggcac tcattttgcc aaaaaattca catggaaatt cgccacacaa 120 0 

40 tactccaatt cagtggtctc ttgggaagca agagcgatga tctctttagg ctataaattc 12 60 

actgaatatt tgagcggtag cgtggatctt gcgtattatg gcgtgcatac taacaaaggc 132 0 

tttaaaccgg gtgaaaacgg gcctgtgcct aaaaacttcc ccgcccttta ttctgacagg 13 8 0 

agcgctttat acacggctct agtagcgtct ttttga 1416 
<212> Type : DNA 

45 <211> Length : 1416 

SequenceName : SEQ ID 492 
SequenceDescription : 



Sequence 
50 

<213> OrganismName : Helicobacter pylori, strain J99 
<400> PreSequenceString : 

atgctaaggc tcgttagtaa aacgatttgt ttgtctttaa ttagcttgtt caacccttta 60 

gaagcctttc aaaaacacca aaaagacgtc ttttttgtag aagctgggtt tgaaaccggg 120 

55 ctattagaag gcgcgcaaac caaagaacaa gcaatagccc aaaacaccca aaacacccaa 180 

aaaatttatg aaaaccccct aacccacccc caaactaaag aacaacctaa agaacaaaac 240 

aaaagcgata cagccacccc acaaagcgtc tatgggagat actacatcct ccaaaacacc 3 00 

attttagaaa aagcgactga gctattcaca gcggctaata tcaatggcaa cggcttaact 3 60 

ttttattctc aaaaccctgt gtatgtgatg gcatacaata aagataatgc cgagtttgaa 420 

60 ggctatggca ataacagcgt ggttgtgata caaaacttcc tgccctacaa tttaaacaat 480 

attgagctga gttatacaga cgctcaaggc aaggcagtca atttaggcgt gatagagacc 540 

atccctaaag attctcaaat catcttgcct gcaagtttgt ttaataattt ttcaaacgat 600 

tcaccattca actctgatgg cctccaacaa ctccaaacca ctaccacccc cttttctgat 660 

gctaacacgc agagtttgtt tgaaaagctc agtcaaatca cgaccaatct tcaaatgact 720 

65 tatgagaata cagacccctt ttctagcggc aacaacgatc ctaatggccc tctcgcttct 780 

cctaaacctc attatgaatg ccctggttat aaaaagagtt gtcaagtcgc ttcggtgtct 840 

ttcaccccac aaaccgcaga agaattgacc aatttaatgt tagacatgat tgcggtgttt 9 00 
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gactctaaafc cttgggaaga agccgtttta aacgcccctt tccaattttc taacagccca 9 60 

tcagagtgcg gcattgatta ccctaaatgc gttaatccct ttaataacgg gcttgttgat 1020 

cctaaagafcg aaaaatacgc gctaacccca gaagaggtta tcaatagtta tagagtcgcc 10 80 

aatgaactta ccgtgaacct cttgaatgcg gccaaggggt fctctagggct aggatcccaa 1140 

ctgggtagog ccaatgcccc cgatgatgat ggcttcaatc aaggtgtttt agggatagcg 1200 

ccttttgctt tagatcctga aaaattgttc ggtaaaaatt tgaataaagt ggctattttg 1260 

gcattaagag acattatcca tgaatatggg catactttag gctataccca taacgggaac 1320 

atgacttatc aaagggtgcg tttatgccaa gaaggaaacg ggccagaggc acgctgtgag 1380 

ggcgggcafcg aagtggagaa aaacggcaaa gaagagctag aattcagtaa tgggcatgaa 1440 

gtgcgagaoc atgatggtta cacctatgat gtttgctctc gttttggcgg caaaaatcag 15 00 

cccgctttcc ctagcaatta ccccaattcc atctatacca attgcgctca agtccccgct 1560 

gggcttata-g gggttactac cgctgtttgg caacagctca tcaatcaaaa cgccctgccc 1620 

attaatttcg ctaacctaaa tagccaaacc agccatttaa acgccgggtt gaatgcgcaa 1680 

aattttgcaa cctctatggt cagcgcgatc gcgcaaaatt tttccaccac ttecactacc 1740 

acttaccgct cttcaagtaa gaattttaga agccctattt taggggttaa tgttaaaata 18 00 

ggctaccaa-c attatttcaa tgactacatc gggttagcct attacggcat tatccaatac i860 

aactacgctc aagctaacga tgaaaaaatc cagcaattaa gctatggtgg gggaatggat 1920 

gtgctgtttg atttcatcac cacttacacc aataaaaagc aagaccatcc aactaaaaag 198 0 

gtttttgctt cctcttttgg ggtgtttggg gggttaaggg gcttatacaa tagctactat 2040 

gtcttcaafcc aagtcaaagg aagcggtaat ttagatatag taaccgggtt taattaccgc 2100 

tacaagcatt ctaaatattc cataggcgtt agcgttcctt taatccaaag cggtattaag 2160 

atcgcttcta ataatggtat ctatgcagac tctgttgttt tgaatgaagg gggtagccat 2220 

tttaaagtgt tttttaatta cgggtgggta ttttaa ~~ 2256 
<212> Type : DMA 
<211> Length. : 2256 

SecjuenceNatne : SEQ ID 493 

SecjuenceDescription : 

Sequence 



<213> Org-anistnWarae : Helicobacter pylori, strain J99 
<400> Pre^SequenceString : 

ttgcaaaact ttgtttttaa taaaaaatgg ctcatctatt ctagcctact ccccttattt - 60 
tttcttaa.cc ctttaatggc agaagacgat gggtttttta tgggggtgag ttatcaaact . 120 

tctttggcog ttcaaagggt ggataactca gggcttaacg ccagtcaaga cgcatccact 180 

tatatccgoc aaaacgctat cgctctagaa tctgcggcgg tgcctttagc ctattattta 240 

gaagcgatgg gccaacaaac gagagtctta atgcaaatgc tctgccctga tccttctaaa 3 00 

agatgtttgc tctatgcagg gggctatcaa aacggacaaa ataataacgg cgatacaggc 360 

aacaaccooc caagaggcaa tgtcaatgcc acctttgata tgcaatctct agtcaataat 420 

ttaaacaacfc tcacccaact catcggcgaa actttaatcc gtaaccctga aaatcttcct 480 

aactccaaag tctttaacgt caaatttggc aatcaaagca ctgttattgc attgcctgag 540 

ggtctagcca ataccatgga cgctttaaac aatgacatca ccaacgcttt aaccacgctc 600 

tggtataacc aaaccttaac gaataaatct tttagcaccc ctagtaacac ttctgtgaat 660 

tttagccccc aagtcttgca acacctttta caagacggct tagccacagc aaataataat 720 

caaaccattt gcagcactca aaaccaatgc accgccacta atgaagctaa atctatcgct 780 

caaaacgccc aaaacatctt ccaggcttta atgcaagcag ggattttagg gggcttagcc 840 

aatgaaaagc aatttggctt cacttacaac aaagccccca atggcagcga ttcccaacaa 900 

ggctatcaaa gctttagcgg cccgggttat tacaccaaaa acgacaacac cacgcaagcg 960 

cccttaaaag cattacccgc tggagcgaca attggatcag gcaatggcca atacacctac 1020 

caccccagct cggcagtcta ttatttagcc gatagcatca tcgctaatgg catcaccgct 10 8 0 

tctatgattt tttcaggcat gcaaaatttc gccaataaag ccgctaaact gataggcact 1140 

tcaagctata accagatgca agatgcgatc aactatgggg aaagcttgct tagtaacacc 1200 

gtagcgta-tg gggatttcat caccaattgg gtcgccccct atttggattt aaacaataaa 1260 

ggtttgaatt tcttgcctaa ttatgggggg caattgaatg gcgctaataa tcaaacccca 1320 

caattaaccc cacaacaagc ccaacaagaa caaaaagtga tcatgaacca attagagcaa 1380 

gccacaaacg cccccacccc cgcgcaaata aacaggattt tagccaaccc ctattccccc 144 0 

acggcaaaaa ctttaatggc ttatgggctc tatcgctcta aagcagtgat tggcggagtg 1500 

attgatga-aa tgcaaactaa agtgaatcaa gtctatcaaa tgggctttgc taggaatttt 1560 

ttggagca-ta actctaattc taataacatg aacggctttg gcgtgaaaat gggctataag 1620 

caattttfccg gcaaaaagcg catgtttggg cttaggtatt atggttttta tgattttggt 1680 

tacgctca.a.t ttggcacaga atcttcttta gtgaaagcca ccctctctag ctatggagcg 1740 

ggcacagact ttctttataa tgtttttacc cgaaaaagag ggactgaagc gatagatata 1800 

ggtttttcfcg ccggtatcca acttgcaggg caaacctgga aaacgaattt tttagatcaa 1860 

gtggatggca accatcttaa acctaaggac acttctttcc aattcctttt tgatttgggc 1920 

ataaggacca atttttccaa aatcgctcat caaaaaagat cccgtttttc tcaagggata 1980 

gaatttggcc ttaaaatacc ggtgctttat cacacctatt accaatcaga aggcgttaca 2040 

gcgaagtata gaagagactt tagtttttat gtgggctaca acataggctt ttga 2094 
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50 



Sequence 



<212> Type : DNA 
<211> Length : 2094 

SequenceName : SEQ ID 494 
5 SequenceDe script ion : 

Secxuence 

<213> OrganismName : Helicobacter pylori, strain J99 

10 <4O0> PreSequenceString : 

atgaaaaaga caattctact ctctctctct ctctctctcg cttcatcgct cttgcacgct 60 

gaagacaacg gcttttttgt gagcgcgggc tatcaaatcg gcgaagcggt gcaaatggtc 12 0 

aaaaacaccg gtgaattgaa aaacttgaac gaaaaatacg agcaattaag ccagtattta 180 

aatcaagtgg cttcgttgaa gcaaagcatt caaaacgcca acaacattga gctggtcaat 24 0 

15 agctctttaa actatttaaa aagctttacc aacaacaact acaacagcac cacccaatcg 30 0 

cccatcttta acgccgtgca agccgttatc acttcggtat tgggtttttg gagtctttat 360 

gcggggaact atctcacttt ttttgtggtt aataaggata ctcaaaaacc cgctagtgtc 42 0 

cagggtaacc ctcctttttc aactattgtt caaaactgct caggaattga aaactgcgcg 480 

atgaatcaaa ccacttatga taagatgaaa aagctcgctg aagatctcca agcagcccaa 540 

20 caaaacgcta ccactaaagc gaacaatctt tgcgctttat ccggatgcgc cacaacacaa 600 

ggccaaaacc caagctcaac cgtaagcaac gctcttaact tagcgcaaca gcttatggat 660 

ttgatcgcaa acactaagac ggctatgatg tggaaaaata tcgtcatcgc tggcgtttca 72 0 

aacgtatccg gcgctatcga ttccactggc tacccaacgc aatacgcggt gtttaacaac 78 0 

attaaggcga tgatacctat cttgcaacaa gcggttacgc tttctcaaag taaccacaca 840 

25 ttatctgcta gcttgcaagc tcaagctaca ggatctcaaa caaaccccaa attcgctaaa 900 

gacatctacg ctttcgctca aaaccaaaag caagtcattt cttacgctca agacattttc 960 

aacctcttta gttctatccc taaagatcag tatcgttatt tagagaaagc ctatttgaaa 102 0 

atacccaatg cgggtaaaac gcctactaac ccttacagac aggaggtgaa tttaaaccaa 10 80 

gaaattcaaa cgatccaaaa caatgtgagt tattatggta atcgggtgga tgcggcttta 1140 

30 agcgtggcta aagatgttta taatttaaaa tccaatcaaa cagaaatcgt aaccacctat 12 00 

aacaacgcta agaatttgag ccaagagatt tctaaactcc cctataacca agtcaataca 12 60 

aaagacatta tcacactgcc ttacgatcaa aacgctccgg cagcgggcca atacaactac 13 2 0 

cagatcaacc cagagcagca atccaatctt tctcaagctt tagcggcgat gagcaataac 13 8 0 

ccctttaaaa aagtgggcat gatcagctct caaaacaata acggcgcttt gaacgggctt 1440 

35 ggcgtgcaag tgggttataa acaattcttt ggcgaaagca aaagatgggg gttaaggtat 15 00 

ta_tggtttct ttgattacaa ccacggctat atcaaatcca gcttttttaa ttcttcttct 1560 

gatatatgga cttatggcgg tgggagcgat ttgttagtga attttatcaa cgatagcatc 1620 

acaagaaaga acaacaagct ttctgtgggt ctttttggtg gtatccaact agcagggact 1680 

acatggctta attctcaata catgaattta acagcgttca ataaccctta cagcgcgaaa 1740 

40 gtoaatgctt ccaatttcca atttttgttc aatctcggct tgaggacgaa tctcgctaca 1800 

gctaagaaaa aagacagcga acgttccgcg caacatggcg ttgaactggg cattaaaatc 1860 

cctaccatta acaccaatta ttattctttt ctaggcacta agctagaata cagaaggctt 1920 

tatagcgtgt atctcaatta tgtgtttgct tattaa . 1956 

<212> Type : DNA 

45 <211> Length : 1956 

SequenceName : SEQ ID 495 
SequenceDe script ion : 



<213> OrganismName : Helicobacter pylori, strain J99 
<4O0> PreSequenceString : 

atgttaaaac tcgccagtaa aacgatttgt ttgtccctaa tcagctcatt cacggctgta 60 

gaagcctttc aaaaacacca aaaagacggc tttttcatag aagccggctt tgaaaccggg 12 0 

55 ctattacaag gcacacaaac ccaagaacaa accatagcca ccactcaaga aaaacccaaa 180 

cccaaaccca aaccaaaacc cattacccct caaagcacct atgggaaata ctacatctcc 24 0 

caaagcacca ttttaaagaa tgcgactgag ttgtttgcag aggataatat caccaactta 3 00 

accttttact ctcaaaaccc tgtgtatgta accgcttata accaagaaag cgctgaagaa 3 60 

gctggctatg gtaataacag cttgattatg atacaaaact tcttgcctta taacttgaac 42 0 

60 aacattgagc tgagttacac ggacgatcaa ggcaatgtgg tcagtttggg cgtgatagag 48 0 

actatcccta aacaatctca aatcattctg cccgcaagct tgtttaacga cccacagctt 54 0 

aacgccgatg gcttccaaca actccaaacc aacaccacac gattttctga tgccagcacg 60 0 

cagaatctgt ttaacaagct cagcaaggtt acaaccaatc ttcaaatgac ttatatcaat 660 

tacaaccaat tttctagcgg taacggcagt ggctctaaac ccccatgccc cccatacgaa 720 

65 aaccaagcaa attgtgtggc taaagtgccg cctttcacct ctcaagacgc taaaaatttg 780 

accaatttaa tgctgaacat gatggcggtg tttgattcta aatcttggga agacgccgtc 840 

tfcaaacgctc ctttccaatt cagcgacaac aacctgtcag cgccatgtta ttctgattac 900 
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cttacatgcg tgaatcctta caacgatggg cttgttgatc ctaaattgat cgccaaaaat 960 

aaaggagatg aatacaatat agaaaacggg caaacaggct cagtgatatt aacgccgcaa 1020 

gatgttatct atagctatag agtcgctaat aatatttatg tgaatctctt gcccacaaga 1080 

ggaggggatt tagggttagg gtctcaatat ggtggcccga atggcccagg cgatgatggc 1140 

accaattttg gcgctttagg gatattgtcc cctttcttag accctgaaat attgtttggc 1200 

aaagaattga ataaagtcgc catcatgcaa ttaagagaca tcatccatga atacggccat 1260 

acttfcaggct atacgcataa cgggaacatg acttatcaaa gagtgcgcat gtgcgaagaa 1320 

aacaa.tgggc cagaagagcg ctgtcagggc ggaaggatag agcaagtgga tgggaaagaa 1380 

gtgcaagtgt ttgacaacgg gcatgaagtg cgagacaccg atggctctac ctatgatgtg 1440 

tgttctcgtt ttaaagataa gccctataca gcgggcagct atcctaattc catctatacc 150 0 

gattgctctc aagtccccgc tgggcttata ggcgttacca gcgctgtttg gcaacaactc 1560 

attgatcaaa acgccctacc ggtggatttt actaatttga gcagccaaac caactatttg 162 0 

aacgccagct tgaacacgca agactttgcg accaccatgc ttagcgcgat cagtcaaagc 168 0 

ctttcatctt, ctaaatctag cgccactact tatcgcactt eaaaaacctc acggcccfctt 1740 

ggagcccccc tattaggcgt taatcttaaa atgggctatc aaaaatattt taatgattat 18 0 0 

ctagggttgt cttcttatgg cattatcaaa tacaactacg ctcaagccaa caacgaaaaa 1860 

atccagcaat taagctatgg cgtgggaatg gatgtgctgt ttgatttcat caccaattac 192 0 

actaacgaaa agaaccccaa aagcaatcta accaagaaag ttttcacttc ctctcttggg 1980 

gtgtttgggg ggttaagggg cttatacaac agctattatt tgttgaacca atacaaaggg 2 04 0 

agcggtaatt taaatgtgac cggtgggttg aattaccgct acaagcattc caaatattct 2100 

atagg-catta gcgttccttt ggtccagttg aaatctagga tcgtttctag cgatggtgct 2160 

tataccaatt ctatcaccct caatgaaggg ggcagtcatt ttaaagtgtt ttttaattac 222 0 

gggtggattt tctaa 223 5 
<212> Type : DNA 
<211> Length : 2235 

SequenceName : SEQ ID 496 

SequenceDescription : 



Sequence 



<213> Organi smNarae : Helicobacter pylori, strain J99 
<400> PreSequenceStrxng : 

atgagaaaac tattcatccc acttttatta ttcagcgctt tagaagcgaa cgagaaaaac 60 

ggctfctttca tagaagccgg ctttgaaact gggctattag aaggcacaca aacgcaagaa 12 0 

aaaagacaca ccaccacaaa aaacacttac gcaacttaca attatttacc cacagacacg 180 

attttaaaaa gagcggctaa tttattcacc aatgccgaag cgatttcaaa attaaaattc 240 

tcatctttat cccctgttag agtgttgtat atgtataatg gtcaattaac tatagaaaac 30 0 

ttcttgcctt ataatttaaa taatgttaag cttagtttta cagacgctca aggcaatgtg 3 60 

atcgatctag gcgtgataga gactat cccc aaacactcta agattgtttt gcccggagag 420 

gcatttgata gtctaaaaat tgacccctat actttatttc ttccaaaaat tgaagccact 480 

agcacttcta tttctgacgc taacacgcag agggtgtttg aaacgctcaa taagattaag 540 

acaaatttgg tcgtaaatta taggaatgaa aacaaattta aagatcacga aaatcattgg 600 

gaagccttta ccccacaaac cgcagaagaa ttcactaatt taatgttgaa catgatcgct 660 

gttttagact cccaatcttg gggcgatgcg atcttaaacg ctccttttga gttcactaac 72 0 

agcccaacag attgcgataa tgatccttca aaatgcgtaa atcctgggac aaacgggctt 780 

gtcaattcta aagtcgatca aaaatatgtg ttaaacaaac aagacattgt caataaattt 84 0 

aaaaacaaag cggatcttga tgtaattgtt ttaaaggatt caggggttgt agggcttggg 90 0 

agtgatatta cccctagcaa caatgatgat ggcaagcatt atggccagtt aggggtagta 960 

gcttctgctt tagatcctaa aaaactcttt ggcgataacc ttaagactat caatttagag 102 0 

gatttaagaa ccatcttgca tgaattcagc cacactaaag gctatgggca taacgggaat 1080 

atgacctatc aaagagtgcc ggtaacgaaa gatggtcaag tggaaaagga tagtaatggc 1140 

aagccaaaag attctgatgg cctcccctat aatgtgtgtt cgctttatgg gggatccaat 1200 

cagcccgctt tccctagcaa ctaccctaat tccatctatc acaattgtgc ggatgtcccg 1260 

gctggctttt taggggtaac agcagcggtt tggcagcagc tcatcaatca aaacgccttg 132 0 

ccgatcaact acgctaactt ggggagtcaa acaaactaca acctaaacgc tagtttaaac 13 8 0 

acgcaagatt tagccaattc catgctcagc accatccaaa aaacctttgt aacttctagc 1440 

gttaccaacc accatttttc aaacgcatcg caaagtttta gaagccctat tttaggggtt 150 0 

aacgctaaaa taggctatca aaactacttt aatgatttca tagggttggc ttattatggc 1560 

atcatcaaat acaattacgc taaagctgtt aatcaaaaag tccagcaatt gagctatggt 162 0 

ggggggatag atttgttatt ggatttcatc accacttact ccaataaaaa tagccctaca 1680 

ggcattcaaa ccaaaaggaa tttttcttca tcttttggta tctttggggg gttaaggggc 174 0 

ttgtataaca gctattatgt gttgaacaaa gtcaaaggaa gcggcaattt agatgtggct 18 0 0 

accgggttga actaccgcta taagcattct aaatattctg tagggattag catcccttta 1860 

atccaaagaa aagctagcgt cgtttctagc ggtggcgatt atacgaactc ttttgttttc 192 0 

aatgaagggg ctagccactt taaggtgttt ttcaattacg ggtgggtgtt ttag 1974 



<212 > Type : DNA 
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<211> Length : 1974 

SequenceName : SEQ ID 497 
SequenceDescrription : 

5 Sequence 



<213> OrganismName : Helicobacter pylori, strain J99 
<400> PreSequenceString : 

atgagtctag ctacgagfcta caatgtgagt aataattttt ctaagtttaa tattaagaga 60 

10 gtcagaggat atttgattitg tcttgtttgt aacacaccta aaatgataca aagaggattg 120 

aatggtgtct cattttatgg ttgctctgat tatgtaaata aaggcgactg taagggcgtt 180 

ttacgagaaa taaatggctc aatgaaaatg gtctgcttac attgtgaaaa cacgcccata 240 

atggaaaaag tagaaagfcgg taggggagga gcttacgctt gtaagaattg caataggaag 3 00 

ttttacttta- tcgatcttgc aaaacaaaac gaacgaaaaa aagatttaga^aaaagaaaaa 3 60 

15 aaagaattgc ttaataagat tgaaaagcaa aaaatcaaac accttgagcg tttcattttg 42 0 

gctggtgtaa aagctaatat taaagaaaat tcttttttct taggatgtaa aaattatcct 480 

aagtgcgaat ggactgcfcag tatggattca caagatctta aatgtcccaa atgcaacaga 540 

ttaatgaaaa gaaaaaagaa tttcaaaaac aatgagtttt ttacagctac atcgcttacc 500 

ttaaatgcaa tagaattttg tctctatatt aatttgaaaa aaaaggaaac caatgtttag 660 



20 

<212> Type : DNA 

<211> Length : 660 

SequenceName : SEQ ID 498 
SequenceDescription : 

Sequence 



<213> OrganismName : Helicobacter pylori, strain J99 
<400> PreSequenceSfcring : 

30 ttggaaatta agaaatattt tctttacgct ctattttttt tgcttttttc tggtcttttt 60 

ttatccaaac ttcaagctta taaattcaac atgagtattg ttggaaaggt gagcagctat 120 

actaagtttg gctttaacraa ccaaagatac cagccttcta aagacattta tcctacaggt 180 

agttatactt ctttactcgg cgaattgaat ttgagcatgg gattatacaa gggcttgagg 240 

gcagaagtag gggctatgjat ggcagcgctt ccctatgact ctaccgccta tcaaggcaat 3Q0 

35 aatatcccta atggccagcc cggatctagg acggatcctt ttggggcggg tatcttttgg 3 60 

caatacattg gctggtatgc aggacatagc ggtttaaacg tgcaaaaacc tcgtttggct 42 0 

atggtgcata acgctttfctt gagctacaac tacaagaaag acaaattcag ttttggcgtg 480 

aaaggggggc gctatgafcgc tgaagagtat gattggttca cttcttacac tcaaggggtt 540 

gaaggctttg tcaaatacaa agacaccagg ttaagggtga tgtattcaga cgctagggct 600 

40 tcagcgtcaa gcgactggtt ttggtatttt gggcgttact atacaagcgg taaggctcta 660 

atgattgcgg atttgaaata cgaaaaagac aacctaaaaa tcaaccctta tttttatgcg 720 

atctttcaaa gaatgtatgc gccaggcatt aatatcactt acgacaccaa ccctaatttc 780 

aacaataagg gctttcgfctt tgtaggcact ttcgtggggt ttttccccat ttttgccact 840 

ccggctaatc aaaatgatrat tatcctattc caacaagtgc cattaggaaa gagcgggcaa 900 

45 acttatttct tccgcactcg tttttactat aacaagtggc aatttggggg tagcgtctat 960 

aaaaatatcg gtaacgctaa tggcgatata ggtatttatg gcgaccctct agggtataac 1020 

atttggacga atagtattta tgacgcagaa attaacaata tcgttggcgc tgatgttatt 108 0 

aacgggtttt tatatgtagg ctcgcagtat agggggttta gttggaaaat tttaggccgt 1140 

tggacggata gcccaagggc tgatgaaagg agtctcgcgc tctttttgag ttatttttct 12 0 0 

50 aataagtata atattagaat ggatttgaaa ctagaatatt atggcaatat caccaaaaaa 1260 

ggctattgta ttgggtafctg tggcatgtat gttccagttg atcctaatgg gcctggcacg 1320 

caacctttaa cacacaacgt gtattctgac aggagccata tcatgtttaa cattacttat 1380 

ggttttagga tttactag 13 98 
<212> Type : DNA 

55 <211> Length : 13 9 8 

SequenceName : SEQ ID 499 
SequenceDescaription : 



Sequence 
60 

<213> OrganismName : Helicobacter pylori, strain J99 
<400> PreSequenceString : 

atgaaaaaga caattctact ctctctctct ctctctctcg cttcatcgct cttgcatgct 60 

gaagacaacg gcttttttgt gagcgcgggc tatcaaatcg gcgaagcggt gcaaatggtc 12 0 

65 aaaaacaccg gcgaattcjaa aaacttgaac gacaaatacg agcagttaag ccaatcttta 18 0 

gcccaactgg cttcgttaaa aaaaagcatt caaacggcga acaacattca ggctgtcaac 240 

aatgctttaa gcgatttaaa aagctttgcg agtaacaacc acacaaacaa agaaacatcg 300 
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cccatctaca acaccgcgca agctgttatc 

gcagggaacg ctctcagttt tcatgtgacc 

ggaagaatcc atagagatgg gaactgcaca 

acttatgata aaatgaagac acttgccgaa 

5 gccttatcag aatgctctag caatcaatca 

cttcaaaccg cgcaacagct catggactta 

aaaaatatcg fccatcgcagg tgttacaaac 

actggtcatg taaccgacta tgcggtgttt 

caacaagcgc fctacgctttc tcaaagtaac 

10 gctatgggat ctcaaacaaa tcgtgaattc 

caaaagcaaa tcctttctaa cgcttcaagt 

gaccaactta agtatttgga gaacgcttac 

actaaccctt acagacagaa tgtgaatttg 

gtagctaatt atggtaatcg tttggattcg 

15 ctaaaatcca atcaaacaga gatcgtaacc 

gagatttcta aacttcccta taaccaagtc 

gattctacag cgggccaata ccaaatcaac 

ttagcggcga tgagcaataa cccctttaaa 

aacggcgctt fcgaacgggct tggcgtgcaa 

20 aaaagatggg cjgttaaggta ttatggtttc 

agctttttta attcttcttc tgatatatgg 

aattttatca acgatagcat cacaagaaag 

ggtatccaac fcagcagggac tacatggctt 

aataaccctt acagcgcgaa agfccaatgct 

25 ttgaggacga atctcgctac agctaagaaa 

gttgaactgg gcattaaaat ccctaccatt 

aagctagaat accgaaggct ttatagcgtg 



acttcagtat tggctttttg gagtctttat 3 60 

ggtttgaatg atggatctaa ttctccttta 420 

ggattacaac aatgttttat gagcaaagaa 4 80 

aacctccaaa aagctcaagg caatctctgt 540 

aatggaggca aaacttccat gactacagct 600 

atcgaacaga ccaaggtttc tatggtgtgg 660 

aaacccaatg gtgctggcgc tatcacatcc 72 0 

aacaacatca aggcgatgct acctatcttg 7 80 

cacaccctat ccactcagtt gcaagctcga 840 

gctaaagaca tctacgcttt agctcaaaac 900 
atcttcaatc tctttaattc cattcctaaa 960 

ttgaaagtgc cacatttggg taaaacccct 1020 

aataaagaaa ttaatgcggt tcaagacaat 10 80 

gctttaagcg tggctaaaga m tgtttataac 1140 

acttataacg atgctaagaa tttgagcgaa 1200 

aatgtaacaa acatcgttat gtcgcctaaa 1260 

ccagagcagc aatccaatct taaccaagct 132 0 

aaagtgggca tgatcagctc tcaaaacaat 13 8 0 

gtgggttata aacaattctt tggcgaaagc 1440 

tttgattaca accacggcta tatcaaatcc 150 0 

acttatggcg gtgggagcga tttgttagtg 1560 

aacaacaagc tttctgtggg tctttttggt 162 0 

aattctcaat acatgaattt aacagcgttc 1680 

tccaatttcc aatttttgtt caatctcggc 1740 

aaagacagcg aacgttccgc gcaacatggc 18 00 

aacaccaatt attattcttt tctaggcact 18 60 

tatctcaatt atgtgtttgc ttattaa 1917 



<212> Type : DMA 
30 <211> Lengtti : 1917 

SequenceName : SEQ ID 500 
SequenceDescription : 



35 



Sequence 



<213> Organ! smName : Helicobacter pylori, strain J"99 
<400> PreSecfuenceString : 

atgaaattaa aaaaacgaaa agttgcggct acattgctaa agcgtttgac cttgccacta 60 

ttgttcacta cgggttcatt aggggcggtt acttatgaag tgcatgggga ttttatcaac 120 

40 ttctccaaag fcgggttttaa ccgttcgcct attaaccctg ttaaaggtat ctatcctaca 180 

gaaacttttg fctaaccttac gggtaagcta gaggggtctg tgcatttagg taggggatgg 240 

accgtgaatg taggcggtgt tttgggcgga caagtttatg ataacactag gtatgatagg 300 

tgggcaaagg attttacccc cccaagctat tgggataaaa cttcttgcgg cactgattct 360 

ttgagccttt cjtatgaatgc gactaaaatg tggcaacagc aagggccagg tggtatcatt 42 0 

45 gaccctaggg gtattggcta tatgtatatg ggtgagtgga acggcttgtt ccctaattac 480 

tatccggcta acgcctactt gcccgggcat tcaaggcgct atgaagttta taaagcgaat 54 0 

cttacctatg acagcgacag agtccatatg gtaatggggc gctttgatgt taccgagcag 60 0 

gagcaaatgg attggattta ccaattgttc caaggttttt atgggacttt caagcttact 66 0 

aagaacatga aattcttgct ctttagctct tggggtcgtg gtatcgctga tggtcaatgg 72 0 

50 ttgttcccta tctatcgtga aaagccttgg ggtattcata aggcgggtat tatttatcgc 78 0 

cctacaaaga atctaatgat ccacccttat gtgtatctca tcccaatggt aggtacattg 840 

cccggtgcta aaatagaata cgataccaat cctgagttta gcggtagagg tataaggaat 90 0 

aaaacgactt tctatgtgtt gtatgactat cgttggaata acgctgaata cggccgttac 960 

gcacccgctc gttataacac ttgggatccg ttcttggata atggtaagtg gcgtggcttg 1020 

55 caaggtcctg gtggtgcgac gctctatttg caccaccaca tagacattaa caactacttt 10 8 0 

gtggttggtg gtgcttacct caacatcggt aaccctaaca tgaacttagg tacttggggt 1140 

aaccctgtgg ctcttgatgg tatcgaacaa tgggtcggtg gcatctacag cttaggcttt 1200 

gcggggattg acaacattac cgatgctgat gcgttcactg agtatgttaa aggtggaggt 1260 

aagcatggta agttcagttg gagcgtttat cagcgcttca ctaccgcacc aagggctttg 132 0 

60 gaatatggta ttggtatgta tctagactat cagttcagca agcatgttaa agcgggtctc 13 80 

aaactcgtgt ggttagagtt ccaaatccgt gcgggttaca accctggaac cggtttcctt 1440 

gggccaaacg gtcagccgct caacttgaat aatggtttgt ttgaatcttc ggcgttcgcg 1500 

caaggccctc aaaacatggg tggtatcgca aaaagcatta ctcaagacag aagccatttg 1560 

atgacacaca tcagttatag tttctaa 1587 

65 <212> Type : DNA 

<211> Length : 1587 

SequenceName : SEQ ID 501 
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SequenceDescription : 
Sequence 

5 <213> OrganismName : Helicobacter pylori, strain J99 
<400> PreSequenceString : 

atgaaaaact tttccccact ctattgfcctt aaaaagctca aaaaacgcca tttaatcgct 60 

ctgagtctgc ccttgctttc ttatgcgaat ggctttaaaa tccaagagca aagcttgaat 120 

ggcacggctt taggctcggc gtatgtcgct ggggctaggg gtgctgacgc ttctttttac 18 0 

10 aacccggcta acatgggctt tactaacgat tggggcgaaa acagaagcga atttgaaatg 240 

accaccaccg tgatcaatat cccggccttt agctttaaag tccctacgac caatcaaggc 300 

ttatattcgg taacaagttt agaaattgat aaaagccaac aaaatatttt aggcatcatc 360 

aacactatag ggttaggcaa tatccttaaa gcgcttggca atacggccgc taccaatggc 420 

ttatcacaag cfcatcaatcg tgttcaaggg cttatgaact taaccaatca aaaagtcgta- .480 

15 accctcgctt caaaacctga cactcaaatc gtgaatggct ggacaggcac gactaatttt 540 

gttttaccta aattctttta taaaacgcgc acgcataacg gcttcacttt tggggggagt 60 0 

tttaccgctc ctagtgggtt gggtatgjaaa tggaatggta agggggggga atttttgcat 660 

gacgtgttta tcatgatggt agagcttgcc cctagcatga gttatactat taataaacgc 720 

ttttctgtgg gtgtgggttt aagggggctt tatgcgaccg ggagctttaa taacaccgtt 78 0 

20 tatgtgcctt tagagggcgc ttcagttttg agcgcggagc aaatcttaaa cttacccaac 840 

aatgtttttg ccgatcaagt gccaagtaac atgatgactt tattaggcaa tattggctac 900 

caaccagcgc ttaattgcca aaaagccggt ggggacatga gtgatcagag ctgtcaagag 960 

ttttacaacg gcttgaaaaa aatcatgggt tatagcggtt taatcaaagc gagcgcgaat 1020 

ctttatggca cgactcaagt cgtgcaaaaa tctaacggac aaggcgtatc gggggggtat 1080 

25 agagtgggtt cgagtttgcg tgtgttfcgat catggcatgt tttctgtggt gtataattct 1140 

tcagttacct ttaacatgaa aggcggtttg gtggctatca cagagcttgg cccttcttta 1200 

gggagcgttt tgactaaagg cagcttgaat atcaatgttt cactccccca aactttaagc 1260 

ttagcctacg cccaccaatt ttttaaagat cgcctaaggg ttgaaggggt gtttgagcgc 1320 

actttttgga gtcaagggaa taaattfctta gtcacccctg attttgcgaa cgccacttac 13 8 0 

30 aagggcttga gcgggacggt ggcttccttg gactctgaaa cgcttaaaaa aatggtaggc 1440 

ctagcgaatt ttaaaagcgt gatgaacatg ggggctggct ggagggacac caacaccttt 1500 

agattagggg taacttacat gggtaaaagc ttgcgtttaa tgggcgctat tgattatgat 1560 

caagccccaa gcccccaaga cgcgataggc attccggact ctaatggcta taccgtggct 1620 

tttgggacta aatacaattt taggggcrttt gatttgggcg tagcggggag tttcactttt 1680 

35 aagagcaacc gctccagttt gtatcaatcc ccaactattg ggcaattgag aatctttagc 1740 

gcctctttag gctatcgctg gtaa 1764 
<212> Type : DNA 
<211> Length : 1764 

SequenceiTame : SEQ XD 502 

40 SequenceDescription : 

Sequence 



<213> OrganismName : Helicobacter pylori, strain J99 

45 <400> PreSequenceString : 

atggcttttc aggtcaatac aaatatcaat gcgatgaatg cgcatgtgca atccgcactc 60 

actcaaaacg cacttaaaac ttcattggag cgattgagtt caggtttaag gatcaataaa 12 0 

gcggctgatg acgcatcagg catgacggtg gcggattctt tgcgttcgca agcgagcagt 180 

ttgggtcaag cgattgccaa cacgaatgac ggcatgggga ttatccaggt tgcggataag 240 

50 gctatggatg agcaattaaa aatcttagac accgttaagg ttaaagcgac tcaagcggct 300 

caagatgggc aaactacgga atctcgtaaa gcgattcaat ctgacatcgt tcgtttgatt 360 

caaggtttgg ataatatcgg taacacaacg acttataacg ggcaagcgtt attgtctggt 42 0 

caattcacta acaaagaatt ccaagtaggg gcttattcta accaaagcat taaggcttct 48 0 

atcggctcta ccacttccga taaaatcggt caggttcgta tcgctacagg cgcgttaatc 540 

55 acggcttctg gggatattag cttgactttt aaacaagtgg atggcgtgaa tgatgtaact 600 

ttagagagcg taaaagtttc tagttdagca ggcacaggga tcggcgtgtt agcagaagtg 660 

atcaataaaa actctaaccg aacaggggtt aaagcttatg cgagcgttat caccacgagc 72 0 

gatgtggcgg tccagtcagg aagttt^agt aatttaacct taaatgggat tcatttgggt 78 0 

aatatcgcag atattaagaa aaacgactca gacggaaggt tagtcgcagc gatcaatgcg 840 

60 gtcacttcag aaaccggtgt ggaagcttat acggatcaaa aagggcgctt gaatttgcgc 900 

agtatagatg gtcgtgggat tgaaatcaaa accgacagcg tcagtaacgg gcctagcgct 960 

ttaacgatgg tcaatggcgg tcaggattta acaaaaggct ctactaacta cggaaggctt 102 0 

tctctcacac gattagacgc taagagcatc aatgtcgttt cggcttctga ctcacagcat 1080 

ttaggcttca cggcgattgg ttttggggaa tctcaagtgg cagaaaccac ggtgaatttg 1140 

65 cgcgatgtta ctgggaattt taacgctaat gtcaaatcag ccagtggcgc gaactataac 12 00 

gccgtgatcg ctagcggtaa ccaaagcttg ggatctgggg ttacaacctt aagaggcgcg 1260 

atggtggtga ttgacattgc cgaatcagcg atgaaaatgt tggataaagt ccgctctgac 13 20 
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ttaggttctg tgcaaaatca aatgattagc actgtgaata acatcagcat cactcaagtg 1380 
aatgttaaag cggctgaatc tcaaatcagg gatgtggatt tcgctgaaga gagcgcgaat 1440 
ttcaacaaaa acaacatttt ggcgcaatca ggcagctatg cgatgagtca agccaatacc 1500 
gttcaacaaa atatcttaag gcttttaact tag 1533 
5 <212> Type : DNA 

<211> Length : 1533 

SequenceMame : SEQ ID 503 

SequenceDescription : 

10 Sequence 



<213> OrganismName : Helicobacter pylori, strain 0*99 
<40 0> PreSequenceString : 

- atggcaggca caeaagctat afcatgaatca tcttctgcag gattcttatc -gcaagtctcc,, 60 

15 tcaatcatct caagcacaag tggtgtcgca gggccatttg caggaatagt agcgggcgct 12 0 

atgacagcag cgattattcc tattgttgtg ggatttacta atccgcaaat gaccgctatc 180 

atgacccaat acaatcaaag catcgctgaa gctgtaagcg tgcctatgaa agccgctaac 240 

caacaataca accaattgta tcaaggtttt aacgatcaaa gcatggctgt ggggaacaat 3 00 

atcttaaata tcagcaaatt aacaggggaa tttaacgcgc aaggcaacac gcaaagcgcg 360 

20 caaattagtg ctgtcaatag tcagattgca agcattttag cgagtaacac tacccctaaa 42 0 

aatcctagcg ctattgaagc ttatgcgacg aatcaaatcg ctgttcctag cgtgccaaca 48 0 

acggttgaaa tgatgagcgg tatattaggc aatattacaa gcgcagcacc aaaatacgcc 540 

ctagctctac aagagcaact gcgttctcaa gcaagcaaca gctcaatgaa tgatacagcc 600 

gattcccttg atagctgtac cgctttaggc gcacttgttg gctcatcaaa agtgtttttc 660 

25 agttgcatgc aaatttctat gactcctatg agtgtttcta tgcccactgt ttatgccaaa 720 

taccaagcgg ttgccactaa agccctaact tcaggcgtta atcctatgac cactcctgca 780 

tgccctattg gggacaaggt tcttgccgtt tattgctatg ctgaaaaagt agcagaaatt 84 0 

ttgagagaat actatataga afcttgtgaaa aacaatacca atttgttgca gaacgcttct 900 

caaatgatac ttaatcaatc aggattagct actagcacct atgacactca agcgatttct 960 

30 aacataagct cgctatataa ttacaatata gtagcgaata aatctttttt gaaatcgcat 1020 

ttgacttatc ttgattacat caaagacaag cttaaggggc aaaaagatag ctacttaaca 10 80 

gaaagggtgc agactaaaat aatcgtgaag tga 1113 
<212> Type : DNA 
<211> Length. : 1113 

35 SequenceName : SEQ XD 504 

SequenceDescription : 



Sequence 



40 <213> OrganismName : HeXicobacter pylori, strain J99 
<400> PreSequenceString : 

atgactaacg aagccattaa ccaacaacca caaaccgaag cggcttttaa cccgcagcaa 60 

tttatcaata atcttcaagt ggcttttatt aaagttgata atgttgtcgc ttcatttgat 120 

cctaatcaaa aaccaatcgt tgataagaat gatagggata ataggcaagc ttttgagaaa 180 

45 atctcgcagc taagggagga attcgctaat aaagcgatca aaaatcctac caaaaagaat 240 

cagtattttt caagctttat cagtaagagc aatgatttaa tcgacaaaga caatctcatt 3 00 

gatacaggtt cttccataaa gagctttcag aaatttggga ctcagcgtta ccaaattttt 3 60 

atgaattggg tgtcccatca aaacgatccg tctaaaatca acacccaaaa aatccgaggt 420 

tttatggaaa atatcataca accccctatc tctgatgata aagagaaagc ggagtttttg 480 

50 aggtctgcca aacaagcttt tgcaggaatt atcataggaa accaaatccg atcggatcaa 540 

aaattcatgg gcgtgtttga tgaatctttg aaagagaggc aagaagcaga aaaaaatgga 600 

gagcctaatg gagatcctac tggtggggat tggcttgata tttttttatc atttgtgttt 660 

aacaaaaaac aatcttccga tctcaaagaa acgctcaatc aagaaccagt tcctcatgtc 72 0 

caaccagatg tagccactac caccactgac atacaaagct taccgcctga agctagggat 780 

55 ttgcttgatg aaaggggtaa tttttctaaa ttcactcttg gcgatatgaa catgttagat 840 

gttgagggag tcgctgacat tcjatcctaat tacaagttca accaattatt gatccacaat 900 

aacgctctgt cttctgtgtt aatggggagt cataatggca tagaacctga aaaagtttca 960 

ttgttgtatg gaaacaatgg tggtcctgaa gctaggcatg attggaacgc caccgttggt 102 0 

tataaaaacc aacgaggcga caatgtggct acactcatta atgtgcatat gaaaaatggc 1080 

60 agtgggttag tcatagcagg tggtgagaaa gggattaaca accctagttt ttatctctac 1140 

aaagaagacc aactcacagg ctcacaacga gcattgagtc aagaagagat ccaaaacaaa 1200 

gtggatttca tggaatttct tgcacaaaat aatgctaaat tagacaactt gagcaagaaa 1260 

gagaaagaaa aattccaaaa tgagattgaa gattttcaaa aagactctaa ggcttattta 1320 

gacgccctag ggaatgatca cattgctttt gtttctaaaa aagacaaaaa acattfcagct 1380 

65 ttagttgctg agtttggtaa tggggaattg agctacactc tcaaagatta tgggaaSaaaa 1440 

gcagataaag ctttagatag gcgaggcaaaa accactcttc aaggtagcct aaaacatgat 1500 

ggcgtgatgt ttgttgatta ttctaatttc aaatacacca acgcctccaa gagtcctgat 1560 
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aagggtgtgg gtgctacgaa tggcgtttcc cattfcagaag caggcttteag caaggtagct 1620 

gtctttaatt tgcctaattt aaataatctc gctatcacta gtgtcgtaag gcaggattta 168 0 

gaggataaac taatcgctaa aggattgtcc ccacaagaag ctaataagct tgtcaaagat 1740 

tttttgagca gcaacaaaga attggttgga aaagctttaa acttcaataa agctgtagct 1800 

gaagctaaaa acacaggcaa ctatgacgag gtgaaacaag ctcagaaaga tcttgaaaaa 18 60 

tctctaaaga aacgagagcg tttggagaaa gatgfcagcga aaaatttgga gagcaaaagc 1920 

ggcaacaaaa ataaaatgga agcaaaatct caagctaaca gccaaaaaga tgagattttt 198 0 

gcgttgatca ataaagaggc taatagggat gcaagagcaa tcgcttacgc tcagaatctt 2040 

aaaggcatca aaagggaatt gtctgataaa cttgaaaata tcaacaagga tttgaaagac 2100 

tttagtaaat cttttgatga attcaaaaat ggcaaaaata aggatttcag caaggcagaa 2160 

gaaacactaa aagcccttaa aggctcggtg aaagatttag gtatcaatcc agaatggatt 2220 

tcaaaagttg aaaaccttaa tgcagctttg aatgaattca aaaatggcaa aaataaggat 22 8 0 

ttcagcaagg taacgcaagc aaaaagcgac cttgaaaatt ccattaaaga tgtgatcatc 23 40 

aatcaaaaga taacggataa agttgataat ctcaatcaag cggtatcagt ,ggctaaagca 2400 

acgggtgatt tcagtggggt agagcaagcg ttagccgatc tcaaaaattt ctcaaaggag 2460 

caattggctc aacaagctca aaaaaatgaa gatttcaata ctggaaaaaa ttctgcacta 2520 

fcaccaatccg ttaagaatgg tgtaaacgga accctagtcg gtaatgggtt atctaaagca 2580 

gaagccacaa ctctttctaa aaacttttcg gacatcaaga aagagttgaa tgcaaaactt 2640 

ggaaatttca ataacaataa caataatgga ctcgaaaaca gcacagaacc catttatact 2700 

caagttgcta aaaaggtaaa agcaaaaatt gaccgactcg atcaaatagc aagtggtttg 27 60 

ggtgatgtag ggcaagcagc gagcttcctt ttgaaaaggc atgataaagt tgatgatctc 2820 

agtaaggtag ggctttcagc taaccatgaa cccatttacg ctacgattga tgatctcggc 28 80 

ggacctttcc ctttgaaaag gcatgataaa gttgatgatc tcagtaaggt agggctttca 2940 

agggagcaaa aattgactca gaaaattgac aatctcaacc aggcggtatc agaagctaaa 3000 

gcaagtcatt ttgacaacct agatcaaatg atagacaagc tcaaagattc tacaaaaaag 3 060 

aatgttgtga atctatatgt tgaaagtgca aaaaaagtgc ctactagttt gtcagcgaaa 3120 

ttggacaatt acgctactaa cagccacaca cgcattaata gcaatgtcaa aaatggaaca 3180 

atcaatgaaa aagcgaccgg catgctaacg caaaaaaatt ctgagtggct caagctcgtg 3240 

aatgataaga tagttgcgca taatgtggga agtgctcctt tgtcagcgta tgataaaatt 33 00 

ggattcaacc aaaagaatat gaaagattat tctgattcgt tcaagttttc caccaggttg 33 60 

agcaatgccg taaaagacat taagtctggc tttgtgcaat ttttaaccaa tatattttct 3420 

atgggatctt acagcttgat gaaagcaagt gtggaacatg gagtcaaaaa tactaataca 3480 

aaaggtggtt tccaaaaatc ttaa 3504 
<212> Type : 'DNA 
<211> Length t 3 504 

SequenceName : SBQ ID 505 

SequenceDescription t 



Sequence 



<213> OrganismName ; Helicobacter pylori, strain J99 
<400> PreSequenceString : 

atgaaaacaa atggtcattt taaggatttt gcatggaaaa aatgcttttt aggcgcgagc 60 

gtggtggctt tattagtggg gtgtagcccg catattattg aaaccaatga agttgctttg 12 0 

aaattgaatt accatccagc tagcgagaaa gttcaagcgt tagatgaaaa gattttactt 18 0 

ttaaggccag ctttccaata cagcgataat attgxrtaaag agtatgaaaa caaattcaag 240 

aatcaaacca cgcttaaagt tgaagagatc ttgcaaaatc agggctataa ggttattaat 300 

gtggatagca gcgataaaga cgatttttct tttgcgcaaa aaaaagaagg gtatttggct 360 

gtcgctatga atggcgaaat tgttttacgc cccgatccta aaaggaccat acagaaaaaa 42 0 

tcagaacccg ggttattatt ctccactggt ttggataaaa tggaaggggt tttaatcccg 480 

gctgggtttg tcaaggttac catactagag cctatgagtg gggaatcttt ggattctttt 540 

acgatggatt tgagcgagtt ggacatccaa gaaaaattct taaaaaccac ccattcaagc 60 0 

catagcggag ggttagttag cactatggtt aaggggacgg ataattctaa tgacgcaatt 660 

aagagcgctt tgaataagat ttttgcaagt atcatgcaag aaatggataa gaaactcact 720 

caaaggaatt tagaatctta tcaaaaagac gccaaggaat taaaaaacaa gagaaaccga 780 

taa 783 
<212> Type : DNA 
<211> Length : 783 

SequenceName : SEQ ID 506 

SequenceDescription : 



Sequence 



<213> OrganismName : Mycoplasma pneumoniae 
<400> PreSequenceString : 

atgaaatcga agctaaagtt aaaacgttat ttactgtttt taccactttt accgctaggg 60 
acgttgtcac tagccaacac ctacctcctc caagaccaca acaccctcac cccctacacg 120 
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ccctttacga caccgctcaa tggggggctg 
tcatacgaac tcgtggactg aaagcgggtg 
tcagcgttgg tcagggtgaa attccaggac 
caaaatgcct tgagttttga tacccaagaa 
5 ggatcttctg acacttccgg gtctaactcc 
aaagccgcgc ccagggccac gtgggtgttt 
gttaagcagg aaagtcaggg ttccggcgat 
aaaaccctcc aagacctcct cgtcgaacaa 
ttagcccggg tgaatggggt tgctcaggat 

10 agttggaatt cccaacgttc ccaaaaaggc 
accggcttta agctcgataa gggccgcgcg 
tatgaacccc tggattcgac caaggagggg 
tcggaaaaaa caacagcgga aaatgatgcc 
gctggaagtg cttctagttt acaacgcaa,t 

15 ttgagatcag cacctgtcag tgttccacca 
tctaaccccg ctcctgtggg cccacaagcg 
gcagcagtgt ccgtcaatcg cacagcgagt 
accgcccagg ccttgcacca gatgggggtg 
aacaacggta cgggtgtagt ggctagccga 

20 gcggcaggtg cttcccaaac gggtttggga 
gcaacgtcac agcgtgccgt cacggtggtt 
gaaactgatg ccctaccgaa tgtcatcacc 
gcttacttaa atggccagat cgttgtgatg 
tgagttgtcg gggaggacca ggaatcgggc 

25 aactggggca ccgacaagca gaagcagttt 
tcaaattcgg attccaaaaa ttcgaatttg 
ctcatcgccg gtcttgacgt tgtggccgac 
gcggtggggt atgatatgac gactgattcg 
tggtcgacca cggccgggtt ggacagtgat 

30 gccgggctca acggcccgat taatggcttg 
acccccgtga gtgggatgaa aggggggagt 
ccggtcaagt ccgaccaaaa ggccaccgcc 
ctcaacagtt atggggatga tggggtgacc 
tttaagttga acgaggagcg cttgccatcg 

35 gtaaacgaaa gtgaactgaa gtccgcacgg 
tcaaacacca aagtcaagtg aaccaacacc 
tacagtgcca atttccccga agcgggtaac 
aagafctagca ccttggaatc gcaagccact 
ggtaccggtc ttaaagccgg tgttgaccca 

40 tatagtgcag tactactagt gcgtggtggc 
gataaactgt tggattctac tgacaaaaac 
tttgggtctg ctgaaagtgc cgtagacctc 
gaaagtggtc tgtggttcta tacctttgac 
caacaacagg tcaaaaaccg caagggttat 

45 gaatttaacg aggacgctaa taccacaacc 
caaaacggga ttgcttccag tcaggacgac 
ttctccgctg tgattaccaa ggatcaaaca 
accaacgggt tgtttgaaaa ggatgatcag 
ggtttggtcc ctatttacaa cgaaggtatc 

50 gccaacagtg ttttgcaagc gcgtaacctc 
aaccccgata tcctccaaag cttctttaag 
atgctagtgg gggaaaagac atcggatact 
ttggatggta acttctatgg tgaggattcc 
ttcccttccc ggatttttgc tggctttgct 

55 gtcggttcat cggtgggcat tctcttaatc 
ccaatgtata aggtccgcaa gcttcaagac 
gatacgttga caaccgctgt gggtagcgtg 
atcaaaaaag ctcctagtgc gttgaaagct 
aaaccagctg ctccaacagc tccaagacca 

60 

<212> Type : DNA 

<2X1> Length : 3657 

SequenceName : SEQ ID 507 
SequenceDe script ion : 

65 



gatgtcgtgc gcgccgccca tttacacccc 180 

ggggatacca agttggtggc gctggtccgc 240 

acaacgagtt cggatcaaag taataccaac 3 00 

tcacagaagg cacttaatgg ctcgcagagt 360 

caagacttcg ccagctatgt cctcatcttt 420 

gaacgcaaga ttaagttggc gttgccctac 48 0 

caaggttcca atggtaaggg ctccctctac 540 

cccgtgaccc cttacacccc gaatgcgggg 600 

acggttcatt ttggttcggg tcaagaatcg 660 

cttaaaaaca accccggacc caaagccgtc 720 

taccggaagc tgaatgaaag ttgaccggtg 780 

aaggggaagg atgagagctc ttggaaaaat 840 

ccgttggtgg ggatggttgg aagtggtgcg 900 

ggctcgaaca gtfccggggtt aaaatcgctc 960 

agcagtacaa gtaatcaaac tttaagctta 1020 

gttgtaagcc aacccgcggg gggtgctacg 10 80 

gacaccgcca cctttagcaa gtacctcaac 1140 

attgttccgg ggttggaaaa atgaggtggt 12 00 

caggatgcta cttccactaa cctgccccat 1260 

actggtfccgc cccgcgaacc agctttaacc 13 20 

gctggccccc ttcgtgcggg caatagcagt 13 8 0 

cagctctatc atacttcaac cgcccaactc 1440 

ggttccga.cc gggtaccgag tctttggtat 1500 

aaagcgacct gatgagcgaa aaccgagctc 1560 

gtcgaaaa.cc agttggggtt taaagatgac 162 0 

aaggcccaag gcctcaccca acccgcctac 168 0 

cacctcgtct ttgcggcctt taaagcgggc 1740 

agcgcttcga cctacaacca agcactcgcc 1800 

ggggggtaca aggccttggt ggaaaacacg 18 60 

tttaccctgc tcgacacctt tgcgtatgtg 1920 

cagaataatg aagaagtgca aacgacttac 1980 

aaaattgcct ccttaattaa tgccagccca 2040 

gtgtttcjatg ccctgggcct taactttaac 210 0 

cgcaccgacc aactgcttgt gtatgggatt 2160 

gaaaatgccc agtcgacctc cgatgataat 2220 

gcctcgcact acctccccgt gccgtattac 2280 

agaaggcgag cggagcagcg gaatggggtg 2340 

gatggctttg ccaactcgtt acttaacttt 24 00 

gctccagtag cacggggtca taaaccgaac 2460 

gttgtaaggt taaactttaa ccccgatact 2520 

agtgaaccta tctccttctc ctatacccca 258 0 

accacgttga aggatgtgac ctatattgct 2640 

aatggtgaaa aaccaacgta cgatggtaaa 2700 

gctgtgatta ccgtatcacg taccggaatt 2760 

ttaagccaag ccccagctgc tttggctgtc 282 0 

ctcacaggca tcctaccgtt atccgatgag 28 8 0 

tggaccgrgta aggttgatat ctataagaac 2940 

ctatcgcjaaa acgtgaagag gcgtgacaac 3000 

gtcgatattt ggggcagagt ggattttgct 3 060 

actgataaaa cggttgatga ggtgatcaat 312 0 

tttaccccag cctttgataa ccaaagagca 3180 

accttaacgg ttaaaccgaa gattgagtac 3240 

aagattgctg gaattccgct caacattgat 33 0 0 

gctttaccgt cctgggtcat tccggtatca 3360 

ctgctcatct taggccttgg tattggaatt 3420 

tccagctttg ttgatgtgtt taaaaaggtg 3480 

tacaagaaga ttatcaccca aacgagtgtg 3540 

gctaata.acg ctgctcctaa agcaccagtt 3 600 

ccagtccaac cacctaaaaa ggcttaa 3657 



Sequence 
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<213> OrganismName : Mycoplasma pneumoniae 
<400> PreSequenceString : 

atgcaccaaa ccaaaaaaac tgccttgtcc aagtccactt ggattctcat cctcaccgcc 60 

accgcctccc tcgcgacggg actcaccgta gtgggacact tcacaagtac caccacgacg 12 0 

5 ctcaagcgcc agcaatfctag ctacacccgc cctgacgagg tcgcgctgcg ccacaccaat 180 

gccatcaacc cgcgcttaac cccgtgaacg tatcgtaaca cgagcttttc ctccctcccc 240 

ctcacgggtg aaaatcccgg ggcgtgggcc ttagtgcgcg acaacagcgc taagggcatc 300 

actgccggca gtggcagtca acaaaccacg tatgatccca cccgaaccga agcggctttg 360 

accgcatcaa ccacctttgc gttacgccgg tatgacctcg ccgggcgcgc cttatacgac 420 

10 ctcgattttt cgaagttaaa cccgcaaacg cccacgcgcg accaaaccgg gcagatcacc 48 0 

tttaacccct ttggcggctt tggtttgagt ggggctgcac cccaacagtg aaacgaggtc 540 

aaaaacaagg tccccgtcga ggtggcgcaa gacccctcca atccctaccg gtttgccgtt 60 0 

ttactcgtgc cgcgcagcgt ggtgtactat gagcagttgc aaagggggtt gggcttacca 660 
cagcagcgaa ccgagagtgg tcaaaatact tccaccaccg gy^aatgtt tggcttgaag , 72 0 

15 gtgaagaacg ccgaggcgga caccgcgaag agcaatgaaa aactccaggg cgctgaggcc 78 0 

actggttctt caaccacatc tggatctggc caatccaccc aacgtggggg ttcgtcaggg 840 

gacaccaaag tcaaggcttt aaaaatagag gtgaaaaaga aatcggactc ggaggacaat 90 0 

ggtcagctgc agttagaaaa aaatgatctc gccaacgctc ccattaagcg gagcgaggag 960 

tcgggtcagt ccgtccaact caaggcggac gattttggta ctgccctttc cagttcggga 102 0 

20 tcaggcggca actccaatcc cggttccccc accccctgaa ggccgtggct tgcgactgag 10 8 0 

caaattcaca aggacctccc caaatgatcc gcctcgatcc tgattctgta cgatgcgcct 114 0 

tatgcgcgca accgtaccgc cattgaccgc gttgatcact tggatcccaa ggccatgacc 12 0 0 

gcgaactatc cgcccagttg aagaacgccc aagtgaaacc accacggttt gtgggactga 12 60 

aaggcgcgcg atgttttgct ccaaaccacc gggttcttca acccgcgccg ccaccccgag 13 2 0 

25 tggtttgatg gcgggcagac ggtcgcggat aacgaaaaga ccgggtttga tgtggataac 13 80 

tctgaaaaca ccaagcaggg ctttcaaaag gaagctgact ccgacaagtc ggccccgatc 144 0 

gccctcccgt ttgaagcgta cttcgccaac attggcaacc tcacctggtt cgggcaagcg 15 0 0 

cttttggtgt ttggtggcaa tggccatgtt accaagtcgg cccacaccgc gcctttgagt 15 60 

ataggtgtct ttagggtgcg ctataatgca actggtacca gtgctactgt aactggttga 1620 

30 ccatatgcct tactgttctc aggcatggtc aacaaacaaa ctgacgggtt aaaggatcta 16 80 

ccctttaaca ataaccgctg gtttgaatat gtaccacgga tggcagttgc tggcgctaag 1740 

ttcgttggta gggaactcgt tttagcgggt accattacca tgggtgatac cgctaccgta 1800 

cctcgcttac tgtacgatga acttgaaagc aacctgaact tagtagcgca aggccaaggt 1860 

cttttacgcg aagacttgca actcttcaca ccctacggat gagccaatcg tccggattta 1920 

35 ccaatcgggg cttgaagtag tagtagtagt agtagtcaca acgcacccta ctacttccac 1980 

aataaccccg attgacaaga ccgtccaatc caaaatgtgg ttgatgcctt tattaagccc 2040 

tgagaggaca agaacggtaa ggatgatgcc aaatacatct acccttaccg ttacagtggc 2100 

atgtgagctt gacaggtata caactggtcc aataagctca ctgaccaacc attaagtgct 2160 

gactttgtca atgagaatgc ttaccaacca aactccttgt ttgctgctat tctcaatccg 2220 

40 gaattgttag cagcfccttcc cgacaaggtt aaatacggta aggaaaacga gtttgctgct 2280 

aacgagtacg agcgctttaa ccagaagtta acggtagctc ctacccaagg aacaaactga 2340 

tcccacttct cccccacgct ttcccgtttc tccaccgggt tcaaccttgt ggggtcggtg 24 00 

ctcgaccagg tgttggatta tgtgccctgg attgggaatg ggtacaggta tggcaataac 2460 

caccggggcg tggatgatat aaccgcgcct caaaccagcg cggggtcgtc cagcggaatt 2520 

45 agtacgaaca caagtggttc gcgttccttt ctcccgacgt tttccaacat cggcgtcggc 2580 

ctcaaagcga atgtccaagc caccctcggg ggcagtcaga cgatgattac aggcggttcg 2640 

cctcgaagaa ccctcgacca agccaacctc cagctctgaa cgggggcggg gtgaaggaat 270 0 

gataaggctt caagtggaca aagtgacgaa aaccacacca agttcacgag cgctacgggg 2760 

atggaccagc agggacaatc aggtacctcc gcggggaatc ccgactcgtt aaagcaggat 282 0 

50 aatattagta agagtgggga tagtttaacc acgcaggacg gcaatgcgat cgatcaacaa 2 88 0 

gaggccacca actacaccaa cctccccccc aacctcaccc ccaccgctga ttgaccgaac 294 0 

gcgctgtcat tcaccaacaa gaacaacgcg cagcgcgccc agctcttcct ccgcggcttg 3 00 0 

ttgggcagca tcccggtgtt ggtgaatcga agtgggtccg attccaacaa attccaagcc 3060 

accgaccaaa aatggtccta caccgactta cattcggacc aaaccaaact gaacctcccc 312 0 

55 gcttacggtg aggtgaatgg gttgttgaat ccggcgttgg tggaaaccta ttttgggaac 318 0 

acgcgagcgg gtggttcggg gtccaacacg accagttcac ccggtatcgg ttttaaaatt 324 0 

cccgaacaaa ataatgattc caaagccacc ctgatcaccc ccgggttggc ttgaacgccc 33 0 0 

caggacgtcg gtaacctcgt tgtcagtggc accacggtga gcttccagct cggcgggtgg 3360 

ctggtcacct tcacggactt tgtcaaaccc cgcgcgggtt acctcggtct ccagttaacg 3420 

60 ggcttggatg caagtgatgc gacgcagcgc gccctcattt gggccccccg gccctgagcg 3480 

gcctttcgtg gcagttgggt caaccggttg ggccgcgtgg agagtgtgtg ggatttgaag 3540 

ggggtgtggg cggatcaagc tcagtccgac tcgcaaggat ctaccaccac cgcaacaagg 3600 

aacgccttac cggagcaccc gaatgctttg gcctttcagg tgagtgtggt ggaagcgagt 3660 

gcttacaagc caaacacgag ctccggccaa acccaatcca ctaacagttc cccctacctg 3720 

65 cacttggtga agcctaagaa agttacccaa tccgacaagt tagacgacga tcttaaaaac 3780 

ctgttggacc ccaaccaggt tcgcaccaag ctgcgccaaa gctttggtac agaccattcc 3840 

acccagcccc agccccaatc gctcaaaaca acgacaccgg tatttgggac gagtagtggt 39 00 
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aacctcagta gtgtgcttag tggtgggggt 
tctggcgtgg atctctcccc cgttgaaaaa 
agcacgagtg acggaaacac ctcctccacc 
aatgatgtgg tgggggttgg tcgactttct 
5 gttgatggta ttgtacgcac cccactcgct 
gacactggtc cacaaagcgt gaagttcaag 
tttacccacc cagtcaccga tctgtttgat 
ataccgctgt ttattgatat cccagcaagt 
ttgagctttg acaccaacga acagagctta 

10 caagataccc aaccaaacaa caacgttcag 
ctgttaacgg cctccagtca aggtccccaa 
gattacgtgt tgccgttagc gatcactgta 
ttaggacttg ccattggaat cccaatgcac 
gcgctatcaa accaaaaggt tgatgtgttg 

15 atcattaacc gcacaggtat cagtcaagcg 
aaaccaggag caccccgccc accagtacca 
caacgaccta aaaaacccgc ttag 
<212> Type : DNA 
<211> Length : 4884 

20 SequenceName : SEQ ID 508 

SequenceDescription : 



gctggagggg gttcttcagcg ctcaggtcaa 3960 

gtgagtgggt ggcttgtggcg gcagttacca 4020 

aacaacctcg cgcctaatac taatacgggg 4080 

gaaagcaacg ccgcaaagat gaatgacgat 4140 

gaactgttag atggggaagcg acaaacagct 42 00 

tctcctgacc aaattgactt caaccgcttg 42 60 

ccggtaacta tgttggtgta. tgaccagtac 4320 

gtgaacccta aaatggttccg tttaaaggtc 43 80 

ggtctccgct tagagttcttz taaacctgat 4440 

gtcaatccga ataacggtgst cttcttacca 45 00 

accttgttta gtccgttfcaa. ccagtgacct 45 60 

cctattgttg tgattgtgct cagtgttacc 462 0 

aagaacaaac aggccttgaa. ggctgggttt 4680 

accaaagcgg ttggtagtgt ctttaaggaa ,,4740 

ccaaaacgct tgaaacaaac cagtgcggct 48 0 0 

ccaaagccag gggctcctaa. gccaccagtg 48 60 

4 884 



Sequence 

25 <213> OrganismName : Mycoplasma pneumoniae 
<400> Pre Sequences t ring : 

atgggttata agttaaagcg atgaccttta gttgcgttta catttaccg-g aattggccta 60 
ggggtggtgc tagcggcgtg ttctgcactc aatacctcca atctgtttcc ccgtcaaaac 12 0 

cgctccaagc agttgattgg gtttaccgaa aacaacatca ttaaacctga agctgtacta 18 0 

30 aaagccgctt tagctgagga caatggtacg gaaaccattt taagggttaa ctttggtgaa 240 
gcgttaaaga gttggtacca aaacaataag gatcgcaaca ttgctacccrcj tttaactatc 3 00 

tttagtgaaa acgtggagga tgaacatgat aacctgttgg accaaaaaca acaggccgaa 360 
ccgattaatt gaccaataga gctccaaaag gaatacgacc agtggggtgg gagtgaaagt 420 
tcctgaaagg ccttaaagtt atacgaccgt ttaatagccg acttccaatc acttatcttt 480 

35 agcaacattg ttgctaatgt gcagttgact gatggtagtg atcagtttaa acccactacc 540 
aaagataacc tcgacagtac tagcaacaaa atcaagtttg ttaattccaa accaaatgat 60 0 

cctaacgggg agttctttgc taacctccag gcttatctat ttgcccagfcg agtagtggag 660 
gaaaaccccc tccccctcac ccaagccttc ttcgcgtacc aagcacccaa ggacgggctc 72 0 

gacagcttat acgaccaagc cgcgattggt agtgccttac agttgggcfca tgccttcccc 78 0 

40 gcgtttcgcg agcccaataa tggacaaagc caaggtaaaa ccaccttcgja ccccaccccc 840 
aacagtgccc aaaacttcgg ggatttcatt aaggcggtgt ttccggagoa. gaagaatggt 90 0 

caaacccaac aatccaatac atcctcccgc accgggttgt ttgactggca aaccaagtga 960 

aacaccaatg gcgctgccaa taagctcttg gtgaccaagt ccaacctgcg cggggccttt 102 0 

aagggcgtgg ggttagctac ggccattatt gaccagtacg agtacctgcjt gggtggaagt 1080 

45 aaaacgagtt cgttgcccga ggtcaaggta gattcgaata agtccaacca aaatccgctg 114 0 

gatagttttt ttatggaggg gaaggatgcg gtggcgatta gatccatag-t ttcgcgtgcg 12 00 

aaaattgcca tgaccgacca aacccccggg tttaaggtca acccggcgt:t tgtcaaggtg 12 60 

aagcagagtc agcagaacga taccttttac caaaaccagc gcaaactgag cggcgggcaa 13 20 

agtggggata ataactccca gggtaaacac cactacctcc aagacgcggt gcggttgacg 13 80 

50 agtagtcaag caatggcggc agcttctact ggggcagatt ctagtagtgg taccaatgtt 144 0 

ggtggtagct ccgggggcaa ctctgtttta attccgctcc caaggagtgc cgcccttacc 15 00 

cacacccagc aacaagttca acaaaccacc tccacactcc aaacccctgt ttacgcccgt 15 60 

ggtgacgatg gtacatacgc cctggcaatt gacggtggtg attatttttt ggcgaacaac 1620 

aaacgcgatt tcaccaaaca agccgacatc ttgctgtacc gctatttaca agctaagagt 1680 

55 aataacttta aggaaaacgg cgtcgagttt agtttgaact tactggaatc tggcagcctg 1740 

ttccaaacgt gagcacaaac ggggttaacc gcgaagttgt acggtgctfct ggtggcgatg 18 00 

atgggttcgg gtcaggggac ccaagttaag ggcagtgtgc aggggagtaig ccgagcagct 18 60 

tctgttagcg ttcaaactac ccagcaaaac agacagcaat cgactgatac ccaagaatcg 1920 

gaagtggtta agttagccaa gtccctgctg aaatcaagtg cggattfcggc caaacccttc 1980 

60 actgataacc ccacctttaa aaaggcctta accgacatcc aatccgagta caaggattat 2040 

ttggctgctg cggggaagtt aagtgagttt aagaaagatt tgggggagcgfc gagtggctta 2100 

cagcaagcta ttattgaccg cgctgataag tacattcaac tagaaaaaca agcccaaaag 2160 

agtgcgattg gcttgggcca accgctccct taccaacgcg ctagtgatcjg tagttacccc 222 0 

gctttagaga agttctttat tccggaagat agtgcagcag atggaaaacgt aaaagcatct 2280 

65 gaatcaggat ctgccgcttt agtaacacta aagacaacag atagtcaaaa aagcacaaac 2340 

accgttaaac aaccggacat caaaccgacc cgggaaaaca acgacaaaaa gttaaagcag 2400 

ttaactagtg atgtagaaac taaagcatct agtttaatca ccaaatgacgg tgcaacaccc 2460 
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caaattggta gtcagttctc cgaaattgtt tcgctcaaga gtaaggacaa taagccccag 252 0 

accaacatga tcttggcctt attgtctgat gttgggatta agtgaaccaa gattctcaat 258 0 

tcctttaagg agtggttctt tactaatacc aacgacttta agaacaatta cgattctgaa 2640 

aagaaagaac ttaaaggcaa tgagtacaag gactttaacg acctcgttaa acaaaccttg 2700 

5 tacttacgtt cctgacaacg actcacttcc aaggaaaagt fctggttatta caaggagttg 2760 

ggtagtgtaa aagcacaagc agctcaatca ggaatggttt cactttcaag tagcgcagca 2 820 

gttgcaaatg ctgtagcttc aagtggcatg caaaaatctg gtgatcaaac cctcttggaa 2880 

ctcggtaaaa aagcgtttga aagtgaactg gaagcatcta gtagtgatgg gcagtacaag 2940 

tacctgcgct tcttgtccac cttaatgtga ctagttaagg atggggcgaa gaactataag 3 00 0 

10 cgcttgctcc aacaagcgat taccgtgggt acccgtgcct fctgtgtcgtg aacggtgagc 3 0 60 

tatgatgata cggctacggc atcagcagcc gcagcaaaag cgcaagtagc agtattaaag 312 0 

acagctcagg caacaaatac tcaaagcgac aaccccttca ataagtttgt ccaaaacccc 318 0 

gattatgtcc agggttcgga aaccaactga tttaacgata agagtacgcc aattaaaccc 3240 

gattcgt.tac tggaaagcga aagcacttac aactttaccg cggaaccgtt ..tgatgataag . ,33 00 

15 acgaaaagtc aaaagaggtc tactggtggt accactaacg aaaaacactt ctttggcttt 33 60 

aacggtttaa cgattaactc gccccaaagt gtgtcgactg ctagtgccgg actcacggaa 3420 

caaatcttta acaactttgg tcagttagta acaagtagtg acaaatccgg ggccttgagc 34 8 0 

cagtacaagg ataaagcaac cctgaaacgc ttaatccaaa acactaattc cgatgccgag 3 54 0 

ctcaatgcct ttggtgaagt gttgcaccgt gcagttaatg fccgataccag taacttagga 3 6 00 

20 cgttttaact ccagtgggga accgttgatt agctttgaca acaagaagaa gttcttggtt 3 660 

gatgtagtgg acaaactgga tgatgtttac tttaacaagt fccgaaggtta tgttggccaa 372 0 

accaaggtga agatgtctga ttcaagtagt agttcccaag cytacgaagac tataagaaag 3 780 

ccaaagccgc accactcacc aaggactcga gtgtcaagac fcttgagctat gtcattcaga 3 84 0 

ttaccaacga ggacgttaac aaaatttctg ctagttgaaa aactgataag aacagtgctt 3900 

25 tag 3903 



<212> Type : DNA 

<211> Length : 3903 

SequenceName : SEQ ID 509 
SequenceDescriptiorx : 

30 

Sequence 



<213> OrganismName : Mycoplasma pneumoniae 
<400> PreSequenceString r 

35 atgaaaaaac ttctaataaa accccagttt tgattcttaa cccttggtgg ctttatcfcct 60 

tcaagtgtta ttttagttgc ctgtgctacc ccttccaatt cagcactcca aactgttttt 12 0 

aaagcacgtt cgaaccagtt cttcaacggg gaacagggta gtttacaaaa cgctttagcc 18 0 

actgctttaa aagatccaga ggccaacaaa cagtttgtcg ctgctcccct tttaaaggca 240 

ttgacagctt ggtatgaaaa taaccaagac aaacaggtca cccagttctt taaagacacc 3 00 

40 aaaaagagtg ttgatgagca atacaaccaa gccgtagata aggtggtatc- ggctagccga 3 60 

aacaaaaacc tctttgtcca acaagacttg ctggatagcg ccgggggggt tagaaacttg 42 0 

aaaagcccag aagttgtttg aacagctcat taa 453 
<212> Type : DNA 
<211> Length : 453 

45 SequenceName : SEQ ID 510 

SequenceDescription : 

Sequence 



50 <213> OrganismName : Mycoplasma pneumoniae 
<4 00> PreSequenceString : 

atgcaacagc aaggagaaac taaagatcaa tataacactt ttggcctgag acttgtgcgt 60 

aacagtgttg gcgtatcagt tttaggactt gatggttttg ttaaattcat aaaaggcggt 120 

agcggaggta gcaatggcgg ttctagtagt gccaaaaaga ttgataaaga agagcaaaag 180 

55 aaattcttaa agttccgtgc ttttcaagcc aaaattggca cattttataa cactaacttt 240 

gcctttagtt ttcccctaaa cgaaacttta aaaggttggt ttgacaaaca ccgcggattg 3 00 

atcttagcga acgccttagt taaagttact ttagacacaa aggaaaaagc aagtaaggcc 3 60 

ttagtggatg cttttagttc ttataaaaac tgattgagtg aatacacccc ggttggttta 42 0 

gctaccacca tgattagctt ttattttgac caaatgaaag ctctcaacaa taagctgtta 4 80 

60 gaacgagtac ggagcttaaa ccaaaatgtt aaccaagcca atcctacccc ttggttaaat 54 0 

gggttatcgg ctaaactacc ttacgttaac actaacggta attacgaaaa gttaaacaac 6 00 

tactttacct ttttaattac caaagttttg tgacctaagg tgggcactga ggatactaat 660 

gtcagtgaag aaaaaagcaa actcaaaact aaaactgaag atgttaacaa gattagggaa 72 0 

aagattttga acaacatcga cagtaagctc aaaacttttg tccaaaaact caaacctacc 780 

65 ttagcacccc gaccagctta cagtaacgta atcttgttaa acattaacaa tgataaggtg 840 

tggtctgctg gcgctaattg aagtttggca gttttactag atccgaaaaa ggttaacccg 9 00 

ctttcgttta tgttgctcaa acaaatgttt gatcaaaaca gtttgtttaa aaaagcaaag 960 
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actttattcg aaaacattca aaataaagca aaaactagtg gaagtggtaa aagtgg-taca 102 0 

accaccaacg atgatgccga tgctttgagc aaagtcattg gcaactatta ctacaacact 108 0 

tgagctaagc taaccgataa atcaatttat ggcaacctta aggatgacaa atttga.tgat 114 0 

ctctttaaat tggcttttga cagtagtatt aacgaaaagt cctttaatgt agatta_taag 120 0 

gcagtgattg aacactaccg ctttatctat accttagagt ggttggtaga caaaaaccta 1260 

aagaacttca aggatttatt aaaggcaaac ctcaagtttg gtgaaattgc ttttafcfcgct 132 0 

tacaagaata ctgaaacgca gaacttctct aacccgcaag gtatattcgg ttcttacttt 13 8 0 

aattacgaaa acgaaacgaa tgcggctaag agtgctacgc aaattataga ccccaacagt 144 0 

ttcttctata aaaccaccac taaaccagaa gcgaaaacca ctcaaagtgc taatacragcfc 15 0 0 

gtgatggttc aaaatacgca gatgaacaat cagcaaacta acagttatgg ctttacfcggt 1560 

ttgagtacca gtagtggttc gatgttaggt gctgctaccc agcaagccat tttgga_tcaa 162 0 

ataaccaaaa cttccttgca acagtatggt tctcaagctg acctgaaaaa gatcattggc 168 0 

gaaactaaaa atcaattatt attagaccga attgccaacc aactaatagc cttaaaaccg 174 0 

aatacaagtg. gcaacagtgg tacccaaaaa acaattgctg catacttcca aacaga_tgcg 1800 

gttggcaatc ctactttgga ctttaaagcg aagcaaaaac tcttattgga tgttttagat 1860 

caatacaaag atttctttgg taataatgca caagcagttc aaagagattc tggtaa.gagt 192 0 

ggaactggca actatttaac ctataccgac ggtagtgata agatcactta tttgca-gttt 198 0 

tcctataaag atattgacgg tttaagtttg agtagttcaa acggaactag cagcaa.attt 2 04 0 

gccagtgatg ttgtagcagc gcttttatta ttccaggcag cctataaagg tactcaacaa 210 0 

ctggctttaa gttccatcaa taaaccacaa ttaccaattg gcgataaacg cataaaaaca 2160 

gggatcgatt tactgaaata g 2181 
<212> Type : DNA 
<211> Length : 2181 

SequenceName : SEQ ID 511 

SequenceDescription : 

Sequence 



<213> OrganismName : Mycoplasma pneumoniae 
<400> PreSequenceString : 

atgaagagtt ttttaagaaa acccaagttt tgacttttat tgttgggtgg cctttccact 60 

agtagtatta ttctcagtgc ctgtgcaaca ccttcaaact cagcactgca agcagttttt 120 

aaaccaactt ccaaccaatt ttttaacggt gagcatggta ccattcaaag cgctttaaac 180 

accgctttaa gagatccgga aactaacaaa aaatttgtag ctgctccact tttaaaagca 240 

ttagaagctt ggtacgagaa taatcaagat aaaaacatta ctcaattctt aaaagacacc 3 00 

aagactaatg ttgataacca atacaaaacc gttgtagata aagtagtttc agcaccacgc 3 60 

aataaatcct tatttgtaca acaagattta ttggacagta gcggtggtag tgaggcrcact 42 0 

tgaaaagcgc gcaaactgtt tgaacagcta attagtgatt ttgcttcacg agtttfctcaa 48 0 

aagaattatt tgtcttataa ggaaaatggc aaagtatcag ccggtccgtt tttatacgac 540 

acgatttcaa aaaatagtaa ttgacagaat atagtttttg atgctgttaa ttttccragaa 60 0 

actaatgatg attttttcgc gaagattcaa agcgaagttt ttgatcaatg ggctgagtac 660 

accgatccaa ctattattag ctcagttacc ttaaagtatt cagctcctaa ttaa 714 

<212> Type : DNA 
<211> Length : 714 

SequenceName : SEQ ID 512 

SequenceDescription : 

Sequence 



<213> OrganismName : Mycoplasma pneumoniae 
<4 00> PreSequenceString : 

atgattaact tcttgtttaa ccaaatgaac gcgttgaaca ataagttttt ggagcgcgct 60 

aaagcattaa accaaaatgt gaatcaggcc aatcctacac cttggttaaa tggttta.tca 120 

gctaaattgc cttatgttag aacaaatggt aattacgaaa aactcaacaa ttacttcact 180 

ttcttaattg ttaaatacat gtggaagaag gtcggtaatg aagacgcttc attatctaaa 24 0 

gatagtagca ttaacaagct caaaactaag accgaagacg ttaacaaaat tagagataaa 3 00 

atcttagaag acattcaaaa gaaggttcaa gaatttgtta aaaacaagct taaaccaacc 3 60 

ttagcaccac gacaaactta cagtaatgta attttgttaa acgttaacaa cgataa^gtt 420 

tgatcgatgg gcgccaattg agctttagct aacttattgg atacgagcaa aattaatcca 480 

ctttcattta tgttgctcaa acaaacgttt gaccaaaacg acttgtttaa gaaagctaag 540 

aaactttttg aagatattca aagtaaaaca aatggtggaa gttcaggtgg gatgcasiggt 60 0 

agcaatacct cgagtagtga aggagctgat gccttgagta aagtaattgg caactattac 660 

tacaacagtt gagctaaatt gactgataaa tccatttatg gaaatcctaa ggacaacaaa 720 

tttgatgact tatttaaatt agcttttgaa gatagtatta acgaaaagtc ctttaatgtt 780 

gattacaaag cggtcattga acactaccgc tttatctata ccttggaatg actagttaac 840 

ggtaacttaa agaactttaa ggacttatta aaagcaaatt taaagtttgg tgaaattgct 900 
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ttcattgctt ataaaaacac tgaaaccaaa gaattttcca atccgcaagg tgtattfccjgt 960 

tctgctttta attacgaaaa cgaaactaat gaagttaaga ttgctgcgca aaacttagac 102 0 

ccaaataatt tcttttataa aacaactacc aaaccagaag aagttaaaac cgcacaaaat 108 0 

ggtgcaagca tgatggttat gcaacaaaaa atgcaaagta ccatgcaaga tagtaaccat 114 0 

5 tatggtttta ccgggctaaa caccagtact agttcaatgt taggtgccgc tacgcagcaa 120 0 

gccattttgg atcaaataac caagaattcc ttgcaacaat atggttcgca gcaagaactg 12 60 

aaaactttga ttgaaaagac taataaccaa cttttattag accgaatcgc tagtcaatta 132 0 

agtggattaa atccttcaac cactggaaat agtaataatg gaaagggtaa aaatattgct 13 8 0 

acttatttcc aattagatgc cattggcaac ccaaccctta gctttcaaca aaaacgaaaa 1440 

10 cttttattag atgttttaga tcagtacaaa gatttctttg gtacaaatac tcaggctgca 1500 

caaagagatt ctggtaaagg tggacatgga agctattcaa cttaccaaga tggcagt<gac 1560 

aagatcactt accttcagtt ctcctataaa gatatagata acttaagttt gagtgataaa 162 0 

ggaaatagta agcttgctag cgatgttgtg gctgcccttt tacttttcca agcagcggat 168 0 

aaaggtaccc aacaattagc cttgagtgct attaattaa -1719 

15 <212> Type : DNA 

<211> Length : 1719 

SequenceName .* SEQ ID 513 
SequenceDescription : 

20 Sequence 



<213> OrganismName : Mycoplasma pneumoniae 
<400> PreSequenceString : 

atgaaaaagt ttctaagaaa accccaattt tgactcttaa cattaggcgg ttttctcytct 60 

25 actagtgtta ttttagctgc ttgtgctacc ccttccaact cggctttaca aaccgttttt 120 
aaagcgcgtt ccagtcagtt tttcaatggt gaacagggta gtttacaaag tgccctaaca 18 0 

acggctttaa aaaatccagt ggccaacaaa caatttatcg ctgcaccact tttaaaagca 240 
ctagaagctt ggtacgaaaa caacgaagac aaaaagatta cccagttttt aaaagacact 3 00 

aagtccaatg ttgacagtca gtacacaacg gcagtcgata aagtggtatc agcatcacgc 3 60 

30 aataaatcac tttttgtgca acaagatttg ttggataacg ccggtggtag tgaagcaacc 42 0 

tggaaagcgc aaaagctgct tgaacagctc attagtgact ttgctagtcg ggttttccaa 480 
aagaactacc tcaattacaa aaaagatgga caagtttcta ctggtccatt tacttafcgat 540 
gaactacaca aggaagaaag ctgaaaaaac tttgaattta gfcgccccacg ttttagfccjaa 600 
actaatgatg actttttcgc caaaattcaa agccaagtat ttgaccaatg ggtggagtac 660 

35 actgatccca ctttaattag tcaagttaac tataagtatt ctgctcccag tcaaggg-tta 720 
ggtcagatct ataacagaga gaagttaaag gataaactaa caccttctta tgcctttcct 78 0 

ttctttgccg aagaaaaaga cattgcaccc aaccaaaacg ttggtaataa gcgctgaaag 840 
cagttagtta aaggtgaggg cgctattact gataataata tcggtcaaag cggtacaaac 9 00 

agccaaaaaa ctggtctgct caaataccgt aatgaatcaa ataagggtga ttttcttgat 960 

40 tttcccttaa atttatcaga tactaacgaa acgaagcaat tagtagacgc ttctaacatt 1020 

gtcgatcagt tagaagctgc taacttaggt gcagctttaa atttaaaact gcaagtfctfct 10 8 0 

gagcaagata atgacgaatt gccgcaaatt aaggagctca aagaagacct taacaacacg 1140 

attgttgtcg ataaaagtaa ggacgtagaa aaagcttcca aaactaacgc actgttttac 120 0 

aatgatcaag aaggtaagca acaacaaagt gactcagatc caattgctgg cgctttacjat 1260 

45 gacatttttg ctcaaaacac aagcgaaggc actaacttaa gtaagttagc ggagcag-gta 13 20 

aagaaagcag ctgcaacaaa aatggaagcg aaaacagctg ttttaagaac taacaattct 13 8 Q 

aagggccaac aaaacaatta cgttgtttta gatgcagcta ttcctacatt taattcaaca 1440 

acatcaaaat cgaaaaataa tagcgcttct aatgaagttt tagttgcctt aaaatctgga 15 00 

tctataaatt taaggcaggt tcagcaaact gatcaaaaca gttacagtcc cattaaattc 1560 

50 cggattgtgc gtaacagcac cggagtaact gttttcggcc ttgatggtgg gagctattat 1620 

ttaaaacaag attcaacaaa taaaaaatca gtttctaagc aaagtttaac cttattaact 1680 

aaatctagtt caggtaacag taataaagta ttaagagacc ttgacaagca aaaacaattc 1740 

ttaaagtttc gtgcctttca agctaaaact aacactttct acagtactaa ctttgccttt 18 00 

agttttccct taaacgaaac gctaaaaagt tggtttgata aacaccggga actaattttg 18 60 

55 gccaacgcct tagttaacgc tagtcttgac caaaaggata aagctagcaa agctttaact 1920 

gaagctttta atccttataa agagttaatt aaagaatttg cgccggtggc tttagcaacc 1980 

acaatgatca gcttttattt tgatcaaatg aaagcgctca ataacaagtt gctagagjcgc 2 040 

gcccgtaatc ttaaccaaaa cgtcaaccaa gccaatccaa ccccttggtt gaacggtttg 2100 

tcagccaagt taccttacgt taatactaac ggtaattacg aaaagttaaa caactacttt 2160 

60 accttcttaa tcacaaaaac attgtgacct aaagttggtc aagaagaaac aagcataagt 2220 

gaagaaagca ataagctcaa aactaaaact gctgacgttg acaagattag ggacaaaatc 2280 

ttggaaaaca tccaaactaa agtaaatgat tttgttaaaa acaaactcaa acctgcttta 2340 

gcaccccgac cagcttacag taacgtcatt ttgttaaacg ttaacaatga caaagtacfct 24 00 

tctagtggtg ctaactggag cttagctagc ttattacaga gcgacaaagt caacccgctt 2460 

65 tcgtttatgt tgctcaaaca agcgttcgat aataacgatt tatttaagaa agcacaaaag 2520 

ttgtttaaag acattcagga aaagtccagt aataatggtg gaatgcaaag tagttccaca 25 80 

accaacagtg atgccgatgc attaagtaaa gttatcggta actattacta cactacttgg 2640 
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gctaaactca cggacaaatc gatttacggt aacccgaagg acaacaagtt tgatgagctt 270 0 

ttcaaactgg cttttgaagc tagcatcgac gaaaagtcct ttaacgttga ttacaaagcg 2760 

gtcattgacc attaccgttt tatctatacc ttacagtggc tagtggacca aaaattaaag 282 0 

aactttaagt cactgttaaa aacaaacctt aagtttggtg aagttgcttt tatagcttac 2880 

5 aagaacactg aaaccactaa cttctctaat ccccaaggcg tatttggttc ctacttcaac 294 0 

tatgagaact ccgctagcga agttaaagaa tccactcaaa cactagatcc caataacttc 3 00 0 

ttttacaaaa caacaaccaa gccaacggta caagccattc aacaagttgc tagcttagca 3 060 

ttagtacaaa aacaacaaat gcaacaaaat tcaactgatc actatggttt tactggtttg 312 0 

agtaccagca ctagttcgat gttcgatgct agttcccgtg atgccatttt gcagcaaatt 318 0 

10 acgaaaacct ccttgcaaca gtacggttcc aaggatcaac tcaaaaagat cattcaaggg 3 240 

actaataacc aattgttact agaccggatt gcagtccagt taagtggatt aaatccttcc 3 3 00 

acaactaatg gtggcagtgg caaaacaatt gcgacctact tccaagtgga tgccgttggc 33 60 

aacccaactt tggacttcca agctaagcgt aaactgttat tagacctctt ggaccagtac 342 0 

caaaactact ttggtaatgg tgcccaaaag agccaaaggg Fittest-act ce aagtggaact 3480 

15 ggcaactacc tcacttacca aaatggtagt gacaagtaca cctacaccca gttcacttac 3 540 

caagacattg acagcttgag tctaacaact acaagcggta ccaacaataa aattgccagt 3 60 0 

gatgttgtgg cagcgctgct tttattccaa gcagcagaca agggcacaca acagttggca 3660 

cttagtgcca ttaataagcc gcagttaaat attggtgata aacggataga aagtgggtta 3720 

aaattgctca aatag 3735 

20 <212> Type : DNA 

<211> Length : 3735 

SequenceName : SEQ ID 514 
SequenceDe script ion : 

25 Sequence 



50 



<213> OrganismName : Mycoplasma pneumoniae 
<4 00> PreSequenceString : 

atggttggaa gtggtgcggc tggaagtgct tctagtttac aaggcaatgg ctcgaacagt 60 

30 tcggggttaa aatcgctctt gagatcagca cctgtcagtg ttccaccaag cagtacaagt 12 0 

aatcaaactt taagcttatc taaccccgct cctgtaggcc cacaagcggt tgtaagccaa 18 0 

cccgcggggg gtgctacggc agcagtgtcc gtcaatcgca cagcgagtga caccgccacc 240 

tttagcaagt acctcaacac cgcccaggcc ttgcaccaga tgggggtgat tgttccgggg 3 00 

ttggaaaaat gaggtggtaa caacggtacg ggtgtagtgg ctagccgacg ggatgctact 3 60 

35 tccactaacc tgccccatgc ggcaggtgct tcccaaacgg gtttgggaac tggtfccgccc 420 

cgcgaaccag ctttaaccgc aacgtcacag cgtgccgtca cggtggttgc tggccccctt 48 0 

cgtgcgggca atagcagtga aactgatgcc ctaccgaatg tcatcaccca gctctatcat 540 

acttcaaccg cccaactcgc ttacttaaat ggccagatcg ttgtgatgag ttccgcccgg 60 0 

gtaccgagtc tttggtattg agttgtcggg gaggaccagg aatcgggcaa agcgacctga 660 

40 tgagcgaaaa ccgagctcaa ctggggcacc gacaagcaga agcagtttgt cgaaaaccag 720 

ttggggttta aagatgactc aaattcggat tccaaaaatt cgaatttgaa gacccaaggc 78 0 

ctcacccaac ccgcctacct catcgccggt cttgacgttg tggccgacca cctcgtcttt 84 0 

gcggcattta aagcgggcgc ggtggggtat gatatgacga ctgattcgaa cgctfccgacc 900 

tacaaccaag cactcgtctg gtcgaccacg gccgggttgg acagtgatgg ggggacaagg 960 
45 ctttggtag 

<212> Type : DNA 
<211> Length : 969 

SequenceName : SEQ ID 515 
SequenceDe script ion : 



Sequence 



969 



<213> OrganismName : Mycoplasma pneumoniae 
<4 00> PreSequenceString : 

55 atgccagttt ttctaaaatt aacgcacaca attagaaaag tgctaagagt tgcaagactt 60 

tctagactag cattactatc actaaccgct gttattttta gcggttgtgc caatattaat 120 

ttaattagtg ctgttggttc ttcttcggta caaccgttgt taagcaaact cagttcgcac 18 0 

tatgtcttga accacaatga caaggataac cttgtagaaa ttagtgtcca agcgggtggc 24 0 

tctagtgctg gggtgaaagc aatcaccaag ggactagctg acattggtaa tgtctcgaaa 3 00 

60 aacaccaaga gctatgctga ggaaaacaag cagttgtgga tggacaaaaa gctcaaaaca 3 60 

attacacttg gcaaagatgc cattgctgtt atttataaag caccatcaga gtttaagggc 420 

aaactagttc tcactaaaga caacctcaac gatctttacg atctgttcgc tggtagcaaa 48 0 

agcgttgaca ttaataagtt tgtcgaaaac ggacaaacca ctaaaaacag taatcataat 540 

ttgataggct tcccccgtac tgggggcgct tttgcttctg gtaccgctga agctttcttg 60 0 

65 aagttttcgg gtcttacaca gacaaaaact ttagataaag attccaaaga aattttggaa 660 

ggtcaacgca actatggccc caatgcgcga ccaactagtg aaaccaacat tgaggccttt 72 0 

aatacctttg tcacaacttt gcgacaaccc aatttatacg gcatggtgta cctcagttta 780 
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gggtttgtga ataacaacat gaacctaatt aagagtgaag gttttgaggt tttaaaagtc 840 

aaatatgata ataacgcagt taccccctcc agtcaagcag tttctagcaa cacttacaaa 900 

tgggtacgcc cgttgaactc agtggtttcc crtgttaccaa aacaaaaaac actgccaagt 9 60 

atccaacgct tttttaactg attgttattt agcaacaaca gtgaaattaa gaaaatatac 1020 

5 gatgactttg gtgtgttaga gttaacggct gacgaaaaga aaaagatgtt taaaacaggt 10 80 

aatgcagaaa tgagcaacat tgccaacttc tcjggttgatg attacagtct gaacaaccaa 1140 
accttcggtg cactctag 1158 
<212> Type : DNA 
<211> Length : 1158 
10 SequenceName : SEQ ID 516 

SequenceDescription : 



Sequence 



15 <213> OrganismName : Mycobacterium, tuberculosis H3 7Rv 
<4 00> PreSequenceString : 

atgagcttcg cggtgctgcc cccggagatc a-attcggcgc gcctgtacgt gggtgccggg 60 
ttggcgccga tgctggacgc ggcggccgct fcgggatggac tggccgacga attgggttcg 12 0 

gccgcggcct cgttttcggc ggtgacggcg g-cjgctggcag gttcctcgtg gctgggcgcg 180 

20 gcgtcgacgg cgatgacggg agcggccgcc occtatctgg gctggttgag cgcggcggcg 240 
gcgcaggccc agcaggcggc cacccaaacc crcjgctggcgg cggccgcctt cgaggcagcc 3 00 

ctggcggcga cggtacatcc ggcgatcatc tcggccaacc gggcactgtt cgtgtcgctg 3 60 

gtggtctcga acctgctggg ccaaaacgcc crcggcgatcg cggccaccga ggccgcctac 420 
gagcagatgt gggcccagga cgtggcggcg a_tgtttggct accatgccgg ggcttcggcg 480 

25 gccgtctcgg cgttgacacc gttcggccag cjcgctgccga ccgtggcggg cggcggtgcg 540 
ctggtcagcg cggccgcggc tcaggtgacc a_cgcgggtct tccgcaacct gggcttggcg 600 
aacgtcggcg agggcaacgt cggcaacggt a_atgtcggga acttcaatct cggctcggcc 660 
aacatcggca acggcaacat cggcagcggc aacatcggca gctccaacat cgggtttggc 72 0 

aacgtgggtc ctgggttgac cgcagcgctg aacaacatcg gtttcggcaa caccggcagc 780 

30 aacaacatcg ggtttggcaa caccggcagc aacaacatcg gcttcggcaa taccggagac 840 
ggcaaccgag gtatcgggct cacgggtagc gjgtttgttgg ggttcggcgg cctgaactcg 900 
ggcaccggca acatcggtct gttcaactcg ggcaccggaa acgtcggcat cggcaactcg 960 

ggtaccggga actggggcat tggcaactcg cycjcaacagct acaacaccgg ttttggcaac 10 20 

tccggcgacg ccaacacggg cttcttcaac tccggaatag ccaacaccgg cgtcggcaac 1080 

35 gccggcaact acaacaccgg tagctacaac crcgggcaaca gcaataccgg cggcttcaac 1140 

atgggccagt acaacacggg ctacctgaac agcggcaact acaacaccgg cttggcaaac 1200 

tccggcaatg tcaacaccgg cgccttcatt actggcaact tcaacaacgg cttcttgtgg 1260 

cgcggcgacc accaaggcct gattttcggg atcjccccggct tcttcaactc gaccagtgcg 1320 

ccgtcgtcgg gattcttcaa cagcggtgcc cjgtagcgcgt ccggcttcct gaactccggt 13 80 

40 gccaacaatt ctggcttctt caactcttcg tcgggggcca tcggtaactc cggcctggca 1440 

aacgcgggcg tgctggtatc gggcgtgatc aactcgggca acaccgtatc gggtttgttc 1500 

aacatgagcc tggtggccat cacaacgccg g-ccttgatct cgggcttctt caacaccgga 1560 

agcaacatgt cgggattttt cggtggccca crcggtcttca atctcggcct ggcaaaccgg 162 0 

ggcgtcgtga acattctcgg caacgccaac Ettcggcaatt acaacattct cggcagcgga. 168 0 

45 aacgtcggtg acttcaacat ccttggcagc cygcaacctcg gcagccaaaa catcttgggc 1740 

agcggcaacg tcggcagctt caatatcggc a_gtggaaaca tcggagtatt caatgtcggt 180 0 

tccggaagcc tgggaaacta caacatcgga tccggaaacc tcgggatcta caacatcggt 1860 

tttggaaacg tcggcgacta caacgtcggc ttcgggaacg cgggcgactt caaccaaggc 192 0 

tttgccaaca ccggcaacaa caacatcggg ttcgccaaca ccggcaacaa caacatcggc 198 0 

50 atcgggctgt ccggcgacaa ccagcagggc fctcaatattg ctagcggctg gaactcgggc 2 040 

accggcaaca gcggcctgtt caattcgggc accaataacg ttggcatctt caacgcgggc 2100 

accggaaacg tcggcatcgc aaactcgggc a_ccgggaact ggggtatcgg gaacccgggt 2160 

accgacaata ccggcatcct caatgctggc agctacaaca cgggcatcct caacgccggc 222 0 

gacttcaaca cgggcttcta caacacgggc agctacaaca ccggcggctt caacgtcggt 228 0 

55 aacaccaaca ccggcaactt caacgtgggt gacaccaata ccggcagcta taacccgggt 23 40 

gacaccaaca ccggcttctt caatcccggc aacgtcaata ccggcgcttt cgacacgggc 2400 

gacttcaaca atggcttctt ggtggcgggc cyataaccagg gccagattgc catcgatctc 2460 

tcggtcacca ctccattcat ccccataaac g-agcagatgg tcattgacgt acacaacgta 2520 

atgaccttcg gcggcaacat gatcacggtc stccgaggcct cgaccgtttt cccccaaacc 2580 

60 ttctatctga gcggtttgtt cttcttcggc ccggtcaatc tcagcgcatc cacgctgacc 264 0 

gttccgacga tcaccctcac catcggcgga <=cgacggtga ccgtccccat cagcattgtc 2700 

ggtgctctgg agagccgcac gattaccttc cztcaagatcg atccggcgcc gggcatcgga 2760 

aattcgacca ccaacccctc gtccggcttc ttcaactcgg gcaccggtgg cacatctggc 2820 

ttccaaaacg tcggcggcgg cagttcaggc gjtctggaaca gtggtttgag cagcgcgata 2880 

65 gggaattcgg gtttccagaa cctcggctcg ctgcagtcag gctgggcgaa cctgggcaac 2940 

tccgtatcgg gctttttcaa caccagtacg cjtgaacctct ccacgccggc caatgtctcg 3000 

ggcctgaaca acatcggcac caacctgtcc cjgcgtgttcc gcggtccgac cgggacgatc 3 060 
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ttcaacgcgg gccttgccaa cctgggccag 
ttcaacctgg gcagcggaaa cgtcggcagc 
tacaatatcg gtccggcgaa cctgggtaac 
tacaacatcg gcttcggcaa cgccggcgat 
5 aacaacatcg gatttgccaa caccggcaac 
aaccagcagg gcttcaattt tgctggcggc 
ttcaactccg gcaccaacaa cgtcggcatc 
ggcaactccg gcagcggcaa caccggcatc 
ttcaacaccg gcatcgtcaa caccggtgtc 
10 tacaacaccg gcgacaccaa caccggcatc 
tacaacaccg gcaatttcag tacgggcttt 
ttcatcaccg gcgacatggg caacggcgcc 
agcgcgggct atcgggtcca tgttcccgaa 
gtcaacatcc ccatcaccgc cagcttcacc 
15 caaatcaact tcggtttcac catcgacatc 
agcaaggccg ttctcccgcc catcaccggg 
gaccctggcg gttcgaccgc gatcaggatc 
gtcacgttcg tcaacattgc ggctaccacg 
tcgggcttct tcaacggcgg ccccggaacc 

20 atttccggct tccagaacgt cgcgaactcc 
ctgcaatcgg gactggcgaa cctgggcgat 
gggg ca ccgg ccaacgtctc gggcatgttc 
cacgaccagg cgaccgggat gtcgatgttc 
aacgtcggct tctccaacgt aggcgacagc 

25 aacctcggca gcggcaacct cggcagcttc 
aacatcggcc cggcgaacct gggtaactac 
aacttcggat tcggcaacgc cggcgacttc 
aacatcgggt ttgccaacac cggcaacaac 
cagcagggct tcaattttgc cggtggctgg 

30 aactctggca ccaacaacat cggtttgttc 
aactcgggca ccggaaactg gggcatcgcc 
aacaccggcg acgtcaacac cggcttgctc 
aacaccggcc attacaacac cggcagcttc 
aacccgggta gctacaacac gggttatttg 

35 aactcgggcg atgtcaacac cggcggcttc 
tggcggggcg attaccaagg cctggcgggg 
gccgttccgg tgaaactgca cgtgccgatc 
ggcacgttca ccgttcatgg cttcagattc 
ggcataccgt tcaatgccgc cacactcgat 

40 cttcccaata tcggcatcaa cctgggtagc 
accggcggtc tattgccgat caagattcca 
gggaactcga cgaccacccc gtcgtcggga 
ggcgttggca acgtgggcag caatagttcc 
ggaatctcgg gcgtccagaa cttcggcgag 

45 actgtctccg gccttgtcaa tgcgagcacg 
gg c gg a ggga atgtcggtgc tacggtcgcc 
ctcgggtttg gcaacgtagg cagcgggaat 
gtcggcctcg gaaacctcgg caacgcgaac 
gtcttctccg gaaaccgggg ctcatacaat 

50 atcggcctag gcaacctggg cagctacaac 
ctgggcttcg ccaacagcgg cagcaacaac 
atcggtatcg ggctgtccgg ccacaatcaa 
gccaacaccg gcctgttcaa ctcgggcacc 
ggaaacatcg gcattggcaa ctcgggcatc 

55 ggcaacaccg gcttggggaa ctcgggcacc 
ggcaatatgg gcgtcgccaa cgtgggcacc 
accaacaccg gcatcgccaa cgtgggcatc 
accaacaccg gcagcttcaa cgacggcgac 
tacaacaccg gcttctacaa caccggcgac 

60 ttcagcaacg gcgccttctg gcagagcgat 
atcactgttc cccagatccc gctactgaac 
atccatctcg acttcggtac ccttgccgtc 
cgcgccctcg gggtcaccca cttcagcgtc 
accttaccgg tgatcgatat caacatcggc 

65 acgatcacca gcggcgccgg cccggtcgtc 
ggtttcggaa actcgaccac cggcccctca 
tcgtctggat tcggaaacgt cggcgccaac 



ttgaacatcg gcagcgcaaa tctcggcgac 3120 

ttcaacgtct tctccggaaa ccagggctca 3180 

tacaacatcg gtttcgcgaa cctgggtaac 3 240 

ttcaaccaag gctttgccaa caccggcaac 33 00 

aacaacatcg gcatcgggct gtccggcgac 33 60 

tggaactcag gcaccgcgaa catcggcttg 3420 

ggcaactcgg gcaccggcaa ctggggtatc 3480 

ggcaacaccg gcagcactaa cacgggcttc 3 540 

gccaacgcgg gcagctacaa caccggctgg 3 600 

gccaacctgg gcgacttcaa cacgggcttc 3 660 

gccaaccagg gtgatatcgc caccggggct 3720 

ttctggcgcg gcgaccagca gggcctattc 3780 

atacccgcac acgtcaccgt ggaagttccc 3840 

aacaccgtct acagcggcat aacgcttgag 3 90 0 ' 

gcagggatcc ccctgctggc cggtgcaatc 3 960 

accggtcccg cgatcacggt caacatcggc 402 0 

ccggccaccg caagcgtcgg tcccttcgat 408 0 

ggctttttca acgccaccac cgatccgtcc 4140 

gtatcgggca tcgccaacat cggcgccaac 42 00 

gcgacctcgg gcttcaacaa ctacggctcg 42 60 

accgtctcgg gcgtattcaa caccggcatc 432 0 

aacatcggca gcaacctcgc ggggttcttc 43 8 0 

aacctcggcc tgggaaacat cggccaattc 4440 

aacgccggct tggcgaacat cggcagcttc 450 0 

aacgtcttcg gcggaaacca gggctcatac 4560 

aacatcggcc taggcaacct gggcagctac 462 0 

aacctgggct tcgccaacac cggcaacaac 4680 

aacatcggta tcggcctgtc cggcgacaac 4740 

aactccggca gcggcaacag cggcctgttc 4800 

aactcgggca ccggcaacat cggcatcggc 4860 

aacaccggtg acaccaacac cggcatcttc 4920 

aacgccggca acgtcaacac cggcatcttc 4980 

aacgccggca gcttcaacac ggccggcttc 5040 

aacaccggca gctacaacac cggactggcc 5100 

atcaccggca attacagcaa cgggttctgg 5160 

atcagccaaa cgatcaccgt gcccgacacc 5220 

ttcctcgata tcccggtcac cggcacactt 52 8 0 

ccggagatca ccggcgatat" cttcttgatc 5340 

gcattcagtt tcccgaacat ctcgattgtc 5400 

gggccggacc cgctgatcga tatcgccggc 5460 

ctcatcgata taccggcggc cccgggattc 552 0 

ttcttcaacg ccggtaccgg taccgtgtcg 5580 

ggcttcttca acctcacctc tggaagctcg 5640 

ctgatctccg gcgggttcaa cttcggtaac 5700 

cttgggcttt cgatgccggc caatctctcc 5760 

ggcttcgtca acaacaccca gatcctcaac 5820 

gtcggccacg gcaatatcgg cgactccaac 5880 

gtcggccatg gcaacatcgg cagcttcaac 5940 

atcggcccgg cgaacctggg taactacaac 6000 

ttcggattcg gcaacgccgg cgatttcaac 6060 

atcgggtttg ccaacaccgg caacaacaac 6120 

caggggttcg gctcctggaa ctccggtacc 6180 

aacaacatcg gtttgttcaa ctcgggcacc 6240 

ggaaacaccg ggatcggcaa cccgggtgtc 63 00 

ggcaactggg gcctgtggaa cccaggcacc 63 60 

tacaacaccg gtggctacaa cgtgggcagc 6420 

gccaacacgg gcagctacaa caccggcagc 6480 

ttcaataccg gcttctacaa caccggcgac 6540 

gtgaacaccg gcgccttcat cgggggcaac 6600 

caccaaggcc agtggggcgc acactacgca 6660 

tttagcctca acattccggt caacatcccc 6720 

aacggcttcc agattccggc tatcaccctc 678 0 

ggacccatca tcgttccgag gatcgccggc 6840 

gaccccggcg gttcatcctc gatacccatc 690 0 

atcccgctac tggacatccc gccggccccc 6960 

tcgggcttct tcaactccgg caccggcagc 7020 

aattcgggct tctggaacac cgctttcgcc 7080 
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ggcataggaa actctggctt gcagaacttc ggctcgctgc aatccggctg ggcgaacttg 7140 

ggcaacaccg tctcgggctt ctacaacacc agtgcggcgg acttcgcgac gccggctaac 7200 

ctctcgggac tctccaacgt cggcgccgac ctgaccggcg tgctccgcgg cccgaacggg 72 60 

tcgaccttca acgcgggcct ggcaaacctc ggccaattca acgtgggcag tgcaaacctc 7320 

ggcagtgcaa acctcggcag tgcaaacctc ggcagtgcaa acctcggcaa ttcaaacgtc 73 8 0 

ggcttcggca acattggcaa cgcgaacatc ggcggcgcaa acatcggcga ctttaacgtc 7440 

gggatcgcaa acaccggtcc ggggctaacg gcggctgtca acaacatcgg tatcggcaac 7500 

accggcaact acaacatcgg tgtcggcaac accggtaact acaacatcgg cttcggcaac 7560 

accggcaaca acaacatcgg catcggcctg tccggcgaca accagatcgg gttcggcccg 762 0 

ctgaacgccg gcatcgccaa catgggcctg ttcaacctgg gcgacaacaa ctttggcatg 768 0 

gccaacgcgg gcaacttcaa ccagggcatt gccaacaccg gcaacaacaa catcggcttg 7740 

ttcaacaccg gcaacaacaa cgtcggcatc tggctgaccg gcgacggctt gtccggcttc 780 0 

agctccctga actccggcgc cggcaacacc ggtttcttca actccggcac cgccaacacc 7860 

ggcttgttca.^ctccggcac cggcaacacc ggcttgttca actcgggcac cggcaacgtc- 792.0. 

ggcatcggca acatgggcac cggcggcttc ggcgtcggcc tatccggcga cagccaggtg 79 8 0 

ggcatcggcg gcaccaactc gggcagtttc aacatcggct tgtttaactc gggcaccggc 8040 

aatgtcggca tcggcaactc gggcaccggc aacgtcggca tcggcaacac cggcaccggc 810 0 

aacaccggca tcggaaacag cggcaactac aacaccggct tgctcaacgc gggcctggtc 8160 

aacaccggca tcgccaaccc gggcaaccac aacaccggcc tgttcaacat cggcaccttc 822 0 

aacaccggca tcgccaaccc gggccactac aacaccggct cctacaacac cggtagctac 828 0 

aacaccggca tggcaaacgc cggagactac ggcaccggcg cgttcatcac cggcagcatg 83 4 0 

aacaacggct tgctctggcg cgccgaccgg cagggcctgc tggcggccaa ctacaccatc 84 0 0 

accatcgagc gacctgccgc gttcctcaat gtcgacatcc cggtcaacat ccccatcacc 8460 

ggcgacatca ccaatgtctc catccccgcc attacgttcc ccagaatcga cgccagcgga 8520 

agcgtcgaca taggcatcct cagtggcacc gtcttggccc cggtcggtcc gatcaccctg 858 0 

catggcgggg acgcgtcggc cccgctggac acacccatcg aaattgactt cggcccctcg 8640 

ccggcgatca acctcaacat cggcaagccc gacggctcca ccgtgatcaa catcgtgggc 870 0 

ggcgccggcg ccggcccgat cagcattccg atcatcgact tgcggccagc gcccggcttc 8760 

ttcaacgcca ccaccggccc gtcgtcgggc ttcctcaact, ggggtgctgg cagcgcatcg 882 0 

ggcttgctga acttcggcaa caactcgggc ctctacaact tcgccactag cagcatggga 888 0 

aattcgggct tccaaaacta tgggtcgctg cagtcgggct gggcgaattt gggcaacagc 8940 

atctcgggca tctacaacac cggcttggga gcaccggcaa atgtctcggg cttgctcaac 900 0 

atcggcacca acctggctgg gtggttgcag aacggcccga ccgagacgac cttcagcgtg 90 60 

ggcttggcca acctcgggtt ctggaatctg ggtagcgcaa acatcggcaa ctacaacctg 9120 

ggcagcgcca acatcggcgt ctacaacctg ggcagcgcca acatcggcga cttcaacctg 918 0 

ggcagcgcca acatcggcga cttcaacctg ggcagcgcca acatcggcag ctccaacatc 9240 

gggttcggca acgtcggtcc ggggctgacg gcggccatcg gcaacatcgg cttcggcaac 93 0 0 

accggaaacg gaaacatcgg catcggcaat accggcaccg gcaacatcgg cttcggcaac 93 60 

accggaaacg gaaacatcgg catcgggctg accggcgaca ccatgaccgg gttcggcggc 9420 

tggaactcgg gcaccggcaa catcgggcta ttcaactccg gcaccggcaa catcggcttc 948 0 

ggcaactccg gcaccggcaa ctggggcatc ggaaactccg gtgactacaa caccggcatc 9540 

ggcaacaccg gcagcaccaa ctccggcttc ttcaacaccg gcctggtcaa caccggcatt 960 0 

ggcaactccg gtgactacaa caccggccta ttcaacgccg ggaacaccaa caccggcagc 9660 

ttcaaccccg gcgactacaa caccggcggc ttcaaccccg gtaactacaa caccggctac 972 0 

ttcaacccgg gcaactccaa caccggcatc gccaactccg gcgatgtcaa caccggcgcc 978 0 

ttcaattcgg gcaactacag caacggcttc ttctggcggg gcgactacca gggcctaggc 9840 

ggtttcgcct accagagcgc cgtttccgaa atcccgtgga gctacgacag gttccaacat 990 0 

tga 9903 
<212> Type : DNA 
<211> Length : 9903 

SequenceName : SEQ ID 517 

SequenceDescription : 

Sequence 



<213> OrganismName : Mycobacterium tuberculosis H3 7Rv 
<400> PreSequenceString : 

atgaacctgg tctccacaac gtcgggaatg tcgggcttcc tcaacgtcgg cgcgctggga 60 

tcgggtgtgg cgaatgtggg caacaccatc tcgggtatct acaacgtggg cacgtcggac 12 0 

ctctcgacgc ccgccgttaa ctccgggttg gcaaatatcg gaaccaatat tgccggcctg 18 0 

ctgcgcgacg gcgcgggtac tgcggctatt aacttgggct tggccaacca cggcaacctc 240 

aacgtgggct tcgcaagtct cggcggcttt aacttcggcg gcgccaccat cggccacaac 3 00 

aacgtcggca tcgggaacac cggaatcttc gatgtcggcc tggcgaacct gggcagctac 360 

aacatcggct tcggaaacct tggcgacgac aacctgggct tcggcaactt cggcagctac 42 0 

aacatcggct tcggcaacgt cggcaacgac aatctgggtt tcgctaacgc gggcggcggc 48 0 

aacatcggct ttgcgaacac cggcagcaac aatgtcggct ttgggaacac gggcagcaac 540 

aatgtcggca tcgggctcac gggcaacgga cagatcgggt tcggcagctt caactcgggc 600 
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agcggaaaca tcggcctgtt caactcgggc 
agcggcaact tcggcatcgc aaactcgggc 
aacaccaata ccggcctatt caactccggc 
agcttcaaca ccggtagctt caacaccggc 
5 aataccaaca ccggctacct caacattggc 
gacgttgaca ccggggcttt catcaccgga 
gattaccagg gcctggtcgg cctcaacctg 
ggcgtgaata ttcccatcga tatcccgatc 
ggcgtcacga ttccgcccac cggcgatatc 

10 cactttggcc ccattaccat tccgaacatc 
ataggagggc cgaataccgc gatcaccata 
atcagtatcc cggcggcgcc aggtttcgga 
ttcaataccg gcgccggcgg cgcctcgggc 
ttttggaacc f tggcctqcgc gacctcgggg 

15 ggatcaggtc tggcgaacgt gggcaccacc 
gacctcgcga cgccggcctt caattcaggc 
ctgctgcgcg acagcacggg caccatggtc 
ctcaacgtcg gcattgcaaa cctcggcgac 
gccaacttcg gcagcgccaa tatcggtggc 

20 ttcgacatcg gtttggcaaa tctgggcagt 
gacaacctgg gcttcggaaa cctcggcagc 
gacaacctgg gcttcgccaa caccggcagc 
aacaatatcg gcattgggct cacgggcgac 
ggcagcggaa acatcggctt gttcaactcg 

25 ggcaacggaa acgttggcat cggcaacacc 
ggcagcacca acaccggctt cttcaactcc 
ggcagcttca acaccggcag cttcaatccg 
ggcagctaca acacgggact cggaaacacc 
ggcagctaca gcaacgggtt cttgtggagt 

30 gcggcgctag cgattcccga aatcgcccta 
cccatcaaca tcgacgccgg ggtcgtcacc 
aataatatcg acttcacccc catcatcatc 
gcgatcaccg tgggcggacc caccacctcg 
tccatcacca tcccgatcat cgacattccc 

35 agtccgtcgt cgggcttctt caactccgga 
gtcgccggcg cctcagggat ttcgggttat 
actaacgtgg gtcacaccgt ctcgggtttc 
ccggcctttg cctccggtct catgcgcgac 
ctggcaaacc tgggcagcaa taacgccggc 

40 gtggcgaatc tgggcaacta caacatcggc 
tttgccaacc taggcagcta caacatcggc 
tttgccaaca ccggcagcaa caatatcggc 
atcggcgctc tgaactcggg cagcggaaac 
atcggcttct ttaactcggg caccgggaac 

45 ttcggcatcg gcaactcggg cagcaccagc 
accggcggct tcaaccccgg taacttcaac 
accggcggcfc tcaacgcggg taacaccaac 
accggcatcg cgaatacggg cgacgtcagc 
aacggcatct tgtggcgggg cgactaccag 

50 attccggaga ttccggcgca cttggacgtc 
agtttcaccg acctcgtggt ggacaatttc 
gcgtttagct ttcacatcca taccgagccg 
gtgctcagcg ttcccacgtt cgcgatcgcc 
agcgccaccg ccggcctcgg ccccatcacc 

55 ggcatcggaa actcgaccac cagcccgtcg 
gcatccgggt tcggcaacgt cggcggcaac 
gcctcaggag tctcgggctt gctcaacgtc 
ggcaacacca tctcgggtat ctacaacacg 
ggctccggcc tcgcaaacat cgccggcctg 

60 gacttggccg gcctcggcaa cctcaatgtc 
gggatcggga acaccggaat cttcaatgtc 
ggcttggcaa acctgggcaa ctacagcgtc 
ggcattgcta acaccggcag tgccaatatc 
ggcatcgggc tcaccggcac cggtcagatc 

65 aacatcggct tgttcaactc cggtgacgga 
aacgtgggca tcggaaacac cggcaccgca 
aacaccggcc tcgggaacac gggcagcacc 



agcaacaaca tcggattctt caattccggc 660 
agcttcaaca ctggcatcgg aaacaccggc 72 0 

gacgtcaaca cgggcgcctt caacccgggc 780 
agcttcaaca ccggtggctt caatccgggc 840 
aactacaaca ccggcatcgc caacaccggc 900 
aactacagca acgggttgtt cttaagcggc 9 60 

gtgatcgata tgcctctccc cataagcctc 1020 

accgcctcgg ccggcaacat cacccttatg 10 8 0 

gtcctttcgt caatagcggg ccagcgagcc 1140 

acggttgtcg gccccacgac gacagtcgcc 1200 

actggcggtg gcgccattag gatcccgctc 12 60 

aactcgacca ccaacccgtc gtcaggtttc 1320 

ttcggcaact tcggcggcgc caattcgggc 13 8 0 

gcgtcggggq tccfccaacgt .cggcgccctg -1440 

gtctcggggt tctacaacac cagcacgtcg 15 0 0 

ctggccaaca tcagcaccag tatcgccggc 15 60 

ctcaacctgg gcttggcaaa ccacggcacc 162 0 

tacaacatcg gctttgcaaa cctcggcagc 168 0 

aacaacatcg gcggcgcaaa caccggaata 1740 

tacaacatcg gcttcggaaa cttcggcgat 18 0 0 

tacaacgtcg gcttcggaaa cttgggcaac 186 0 

aacaatatcg ggttcgcgaa caccggcagc 192 0 

ggccagatcg ggttcggctc cctgaattct 1980 

ggcagcggaa acatcggctt tttcaactcg 2040 

ggcaccgcaa acttcgggct tggaaacacc 2100 

ggcgacgtca ataccggtat cggcaacacc 2160 

ggcgattcca acaccgggga tttcaaccca 222 0 - 

ggcgatgttg acaccggcgc cttcatctcc 22 8 0 

ggaaattatc agggcctcat tggcttgcac 2340 

acctttggcg tcgacatccc gatacatata 2400 

ctccagggct tcagcatcgt agctgccgaa 2460 

ccgaccatca atatcacctt gcccacggcg 2520 

atcggtatca ccgccagcgc cggtatcggc 258 0 

gcgacatcgg gcttcggcaa ctcgaccact 2640 

gcgggcagcg cgtcgggctt tttgaacgtg 2700 

ctcaatgtcg gtgcgctggg gtcgggtgtg 2760 

tacaacgcga gcgcgttgga cctcgtgacg 2820 

ggtatgggca cgatgactct gaaccttggg 2880 

ttcggcaaca ccgggatctt tgacgtcggc 2940 

ttcggaaact tcggcgacga caacctgggc 3000 

gttgccaaca ccggcagcaa caatatcggc 30 60 

atcgggctca ccggtaccgg ccaaatcggg 3120 

atcggcttgt tcaactcggg cgacggaaac 3180 

ttcggcatcg gcaacaccgg caccggaaac 3240 

acgggcttgt tcaactcggg cgacggcaac 33 00 

accggcaatt tcaataccgg cagcttcaac 33 60 

accggccact tcaacaccgg gaactacaac 342 0 

accggcgctt tcatctccgg caactacagc 3480 

ggcctgatcg gttactccta cgcgctgact 3 540 

aatatcccaa tcgacatacc gatcaccggc 3600 

actatcccca tcatcggctt cgaatccttc 3660 

gacatcggtc ccatcattgt cccgagcttc 3720 

gtgggcggac ccacgaccgc gatcaacatc 3780 

atcccgatca tcgacattcc ggcagcgccg 3 840 

tcaggcttct tcaacaccgg cgccggcacc 3900 

acatcgggcc tgtggaacct tgcgtcggca 3 960 

ggcgcgttgg gatcgggtgt ggcgaatgtg 4020 

agcccgctgg acctcgggac gccggccttc 4080 

ctgcagggcg gcgccggcac gacgatcctc 4140 

ggcttggcaa acctcggggg ctctaacttc 42 00 

ggtttcgcaa acgtgggcaa ccacaacatt 4260 

ggcttcgcca actcgggcaa ctaccatatc 4320 

ggcttcgcca acaccggtag cggcaatatc 43 8 0 

gggttcggca gcttcaactc gggcagccac 4440 

aacgtaggat tcttcaactc gggcaccggc 4500 

aacttcggca tcgcaaactc gggcagcttc 4560 

aacacgggcc tgttcaaccc gggcaacgtc 4620 
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aacaccggcg tcggcaacac cggcagcatc aacaccggca gcatcaacac cggcagcttc 468 0 

aacactggca gcaccaatac cggcagcttc aacctcggcg atcacaacac cggcagcttc 4740 

aactccggtg actacaacac gggctacttc aacgcgggtg actacaacac gggtgtggcc 48 00 

aacacgggca acgtcaacac cggcgcgttc atctccggca attacagcaa cgggttcttc 48 60 

5 tggcgaggtg actaccaggg gttgattggc ctttccacaa cgatcaccat tcccgaaatc 492 0 

ccctaccgct acgacttgag tgttccaatc gacataccca tcaccggcac cgtcgtcgcc 4980 

accacgccaa acagtttcac cattcccggt ttccagatac gagtcttgct tggtcctgcg 5040 

gcggtgcttg tcaacgagat gatcggcccc atcacgatcg atgtcaatca agtcatcgcc 5100 

atcgattcgc ccattcagca aaccatcagc atggttggca ccggcggctt cggcccgatc 5160 

10 cccatcggca tcagcatcgg tggtaccccg ggtttcggca actcgaccac cggcccgtcg 5220 

tcgggtttct tccacaccgg cgccggccat gtatcgggct tcgggaactt cggcgccggc 52 80 

aacatgtcgg gctccgggaa cttcggcgct ggcaattcgg gcttctttaa cgccggcggc 53 40 

ttgggcaatt cgggcctact gaatttcggc gcgctgcagt cgggtctggc gaacctgggc 54 00 

aacaccatct cgggcgtcta caacacgagc acgcfcggacc ^fccigcgacycc cgccttcggc 54.6Q 

15 tcgggcatcg caaacatcgg cgccaacctg gccggcctgt tcctcgacaa caccggcaac 552 0 

ctgacgctga acttcggcgt cgcaaaccag ggcggcctca acgcgggcat cgggaacctg 5580 

ggcagcgtca acatcggctt cgttaatacc ggcgactcca acctgggcat cggcaacctc 564 0 

ggcgacctca acttcggcgg ggtcaacatc ggcggtaaca acatcggcat cgccaacacc 5700 

gggatcttcg atatcggctt ggcgaacctg ggcagctaca acatcgggtt ggcaaatctg 5760 

20 ggcgacgaca acctgggctt tggcaacgcc ggcagctaca acatcggctt cgcgaacttc 5820 

ggcagcgaca acctgggctt tgccaacacc ggcagctaca acatcggctt cgcgaatacc 58 8 0 

ggtaacaaca acatcggcgt cgggctcacc ggcaacggcc agatcgggat cggcagcctc 5940 

aactcgggca gcaacaacat cgggctgttc aactccggca gcggaaacat cgggttcttc 60 0 0 

aactcgggca ccggcaacgt cggcatcttt aacaccggca ccggcaactt cggtctcgcg 6060 

25 aactcgggcg gcttcaacac cggcatcggc aacgcgggca gcaccaacac gggcgtgttc 612 0 

aaccccgggg acctcaacac cggcagcttc aacccgggca gcttcaacac cggcggcttc 618 0 

aacccgggca gtggcaacac gggctacctc aacaccggtg actacaacac gggcgtggcg 6240 

aacacgggcg atgtggacac cggtgcgttc attaccggca gctacagcaa cggcttcttg 63 0 0 

gtgagtggcg actatcaggg cctgatcggc ctgccgctgt tgggcattcc ggtgaccccc 63 60 

30 ggctacttca acctcactgg cggcccgtcg tcgggcttct tcaacagcgg cgccggaagc 642 0 

gtatcgggat tcgtgaactc cggtgccggc ctgtcgggct acctcaatac cggcgcgctg 6480 

ggatcgggtg tcgccaacgt gggcaacacc atctcgggct ggttgaacgc cagcgcgctg 6540 

gatctcgcga cgccggggtt cctttccggc atcggtaact ttggcaccaa cctggcgggt 660 0 

ttctttaggg gataa 6615 

35 <212> Type : DNA 

<211> Length : 6615 

SequenceName : SEQ XD 518 
SequenceDescription r 

40 Sequence 



<213> OrganismName : Mycobacterium tuberculosis H37Rv 
<400> PreSequenceString : 

atgtcgttcg tgttgatcgc accggaattc gtgacagcag ccgcggggga tctgacgaat 60 

45 ctgggttcgt cgattagcgc ggccaacgcg tcggcagcca gtgcgaccac gcaggtgctg 12 0 

gctgcgggcg ccgatgaggt gtctgcccgt attgcggcgc tgttcggcgg gtttggcctg 180 

gagtaccagg cgattagtgc gcaggtggcg gcctaccacc agcggtttgt gcaggccttg 240 

agtaccggcg cgggcgcata tgcctcggcc gaggccgccg ccgctgagca gatcgtgctg 3 00 

ggcgtgatca atgcgcccac ccaggcgctg ctggggcgcc cgttgatcgg tgacggcgcc 3 60 

50 aatgcgacga ctcccggcgg ggccggcggg gccggcggtc tgctgttcgg caacggcggg 420 

gccggggcag ccggggcgcc cggccaggcc ggcgggcctg gcgggcccgc cggattgtgg 480 

ggcaacggcg ggcccggcgg ggccggcggc agcggtgggg gcaccggcgg tgccggcggc 540 

gccggtgggt ggctgttcgg ggttggcggc gccggcggtg tcggtggggc cggtggcggc 600 

accggcgggg cgggcgggcc cggtggtttg atctggggcg gcggcggggc cggcggtgtc 660 

55 ggtggggccg gtggcggcac cggcggggcc ggcggccgcg ccgagctgct gttcggcgcc 720 

ggcggtgcgg gtggggcggg caccgacggc gggcccggtg ctaccggcgg gaccggcgga 780 

cacggcggag tcggcggcga cggcggatgg ctggcacccg gcggggccgg cggggccggc 840 

gggcaaggcg gggcaggtgg tgccggcagc gatggtggcg cgttgggtgg taccggcggg 900 

acgggcggta ccggcggcgc cggtggcgcc ggcggtcgcg gcgcactgct gctgggcgct 960 

60 ggcggacagg gcggcctcgg cggcgccggc ggacaaggcg gcaccggcgg ggccggcgga 1020 

gatggcgttc tggggggtgt cggtggcact ggtggtaagg gcggtgtcgg cggcgtggct 1080 

ggcctcggcg gggccggtgg tgccgcgggc cagctcttca gcgccggagg cgcggcgggt 1140 

gccgttgggg ttggcggcac cggcggccag ggtggggctg gcggtgccgg agcggccggc 1200 

gccgacgccc ccgccagcac aggtctaacc ggtggtaccg ggttcgctgg cggggccggc 1260 

65 ggcgtcggcg gccagggcgg caacgccatt gccggcggca tcaacggctc cggtggtgcc 1320 

ggcggcaccg gcggccaagg cggcgccggc ggcatgggtg gctccggtgc tgataatgcc 13 80 

agcgggattg gcgccgacgg cggcgcgggt gggactggcg gtaacgccgg cgccggcggg 1440 
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gccggcgggg ccgccggcac cggaggaacc ggcggggttg tcggcgccgc gggcaaggcc 150 0 

ggtatcggcg gcaccggcgg ccaaggcggc gccggcggcg cgggcagcgc cggcacggat 1560 

gcgaccgcta ccggtgccac cggcggcacc gggttttccg gtggagccgg cggggccggc 1620 

ggggccggcg gcaacaccgg ggttggcggc accaacggct ccggcgggca aggcggcacc 1680 

5 ggcggcgcgg gcggcgccgg tggtgctggc ggtgtcggcg ccgacaaccc caccggcatc 174 0 

ggcggcaccg gcggcaccgg cgggaaaggc ggcgccggcg gggccggcgg gcagggcggt 180 0 

agcagcggtg ccggcggcac caacggctct ggtggcgctg gcggcaccgg cggacaaggc 1860 

ggcgccgggg gcgctggcgg ggccggcgcc gataacccca ccggcatcgg cggcgccggc 192 0 

ggcaccggcg gcaccggcgg agcggccgga gccggcgggg ccggtggcgc catcggtacc 19 80 

10 ggcggcaccg gcggcgcggt gggcagcgtc ggtaacgccg ggatcggcgg taccggcggt 2040 

acgggtggtg tcggtggtgc tggtggtgca ggtgcggctg cggccgctgg cagcagcgct 2100 

accggtggcg ccgggttcgc cggcggcgcc ggcggagaag gcggagcggg cggcaacagc 2160 

ggtgtgggcg gcaccaacgg ctccggcggc gccggcggtg caggcggcaa gggcggcacc 222 0 

-ggaggtgccg, gcgggtccgg cgcggacaac cccaccggtg ctggtttcgc cggtggcgcc 22,8 0 

15 ggcggcacag gtggcgcggc cggcgccggc ggggccggcg gggcgaccgg taccggcggc 2340 

accggcggcg ttgtcggcgc caccggtagt gcaggcatcg gcggggccgg cggccgcggc 2400 

ggtgacggcg gcgatggggc cagcggtctc ggcctgggcc tctccggctt tgacggcggc 2460 

caaggcggcc aaggcggggc cggcggcagc gccggcgccg gcggcatcaa cggggccggc 252 0 

ggggccggcg gcaacggcgg cgacggcggg gacggcgcaa ccggtgccgc aggtctcggc 2 580 

20 gacaacggcg gggtcggcgg tgacggtggg gccggtggcg ccgccggcaa cggcggcaac 2640 

gcgggcgtcg gcctgacagc caaggccggc gacggcggcg ccgcgggcaa tggcggcaac 2700 

gggggcgccg gcggtgctgg cggggccggc gacaacaatt tcaacggcgg ccagggtggt 27 60 

gccggcggcc aaggcggcca aggcggcctg ggcggggcaa gcaccacctc gatcaacgcc 2820 

aacggcggcg ccggcggcaa cggcggcacc ggcggcaaag gcggcgccgg tggtgcggga 2880 

25 accctgggcg tcggcggctc cggcggcacc ggcggggacg gcggcgatgc gggctctggt 2940 

ggtggcggcg gcttcggcgg ggccgcgggt aaggccggcg gcggcggaaa cggcggccgc 3 000 

ggcggtgacg gcggcgatgg ggccagcggt ctcggcctgg gcctctccgg ctttgacggc 3 060 

ggccaaggcg gccaaggcgg ggccggcggc agcgccggcg ccggcggcat caacggggcc 3120 

ggcggggccg gcggcaacgg cggcgacggc ggggacggcg caaccggtgc cgcaggtctc 3180 

30 ggcgacaacg gcggggtcgg cggtgacggt ggggccggtg gcgccgccgg caacggcggc 3240 

aacgcgggcg tcggcctgac agccaaggcc ggcgacggcg gcgccgcggg caatggcggc 33 00 

aacgggggcg ccggcggtgc tggcggggcc ggcgacaaca atttcaacgg cggccagggt 33 60 

ggtgccggcg gccaaggcgg ccaaggcggc ctgggcgggg caagcaccac ctcgatcaac 3420 

gccaacggcg gcgccggcgg caacggcggc accggcggca aaggcggcgc cggtggtgcg 3480 

35 ggaaccctgg gcgtcggcgg ctccggcggc accggcgggg acggcggcga tgcgggctct 3540 

ggtggtggcg gcggcttcgg cggggccgcg ggtaaggccg gcggcggcgg aaacggcggt 3600 

gttggcggtg acggcggcga gggagccagc ggtctcggcc tgggcctctc cggctttgac 3660 

ggcggccaag gcggccaagg cggggccggc ggcagcgccg gcgccggcgg catcaacggg 3720 

gccggcgggg ccggcggcac cggcggggcc ggtggtgacg gcgccccggc gaccctgatc 3780 

40 ggcggacccg acggcggtga cggcggccaa ggcggcatcg gcggggacgg cggcaacgcc 3840 

ggattcggcg ccggtgttcc cggcgacggc ggggacggcg gcaacgccgg attcggcgcc 390 0 

ggtgttcccg gcgacggcgg gatcggcggc accggcgggg ccgggggcgc cggcggcgcc 3960 

ggcgccgacg gggaccccag cattgacggc ggccaaggtg gtgccggcgg ccacggcggc 4 020 

caaggcggca aaggcggcct gaacagcacc gggctagcca gcgccgccag cggtgacggc 4080 

45 ggcaacggcg gggccggcgg ggccggcggc aacggcggcg acggcgacgg ctttatcggc 4140 

gggtccggcg gcaccggcgg gaccggcggc gacgccggcg tcggcggcct ggccaacacc 42 00 

ggcggaaccg cgggcaacgc cggtatcggc ggggccggcg gccgcggcgg cgacggcggg 42 60 

gccggcgaca gcggcgccct ctcccaagac ggcaacggct tcgccggcgg ccaaggcggc 43 20 

caaggcgggg tcggcggcaa cgccggcgcc ggcggcatca acggggccgg cggcaccggc 43 80 

50 ggcaccggcg gggccggtgg tgacggccag aacggaacga caggcgtggc gagcgagggc 4440 

ggcgccggcg gccaaggcgg tgacggcggc caaggcggca tcggcggggc cggcggcaac 4500 

gccggattcg gcgccggtgt tcccggcgac ggcgggatcg gcggcaccgg cggggccggg 4560 

ggcgccggcg gcgccggcgc cgacggggac cccagcattg acggcggcca aggtggtgcc 462 0 

ggcggccacg gcggccaagg cggcaaaggc ggcctgaaca gcaccgggct agccagcgcc 4680 

55 gccagcggtg acggcggcaa cggcggggcc ggcggggccg gcggcaacgg cggcgacggc 4740 

gacggcttta tcggcgggtc cggcggcacc ggcgggaccg gcggcgacgc cggcgtcggc 4800 

ggcctggcca acaccggcgg aaccgcgggc aacgccggta tcggcggggc cggcggccgc 4860 

ggcggcgacg gcggggccgg cgacagcggc gccctctccc aagacggcaa cggcttcgcc 4920 

ggcggccaag gcggccaagg cggggtcggc ggcaacgccg gcgccggcgg catcaacggg 4980 

60 gccggcggca ccggcggcac cggcggggcc ggtggtgacg gccagaacgg aacgacaggc 5040 

gtggcgagcg agggcggcgc cggcggccaa ggcggtgacg gcggccaagg cggcatcggc 510 0 

ggggccggcg gcaacgccgg attcggcgcc ggtgttcccg gcgacggcgg gatcggcggc 5160 

accggcgggg ccgggggcgc cggcggcgcc ggcgccgacg gggaccccag cattgacggc 5220 

ggccaaggtg gtgccggcgg ccacggcggc caaggcggca aaggcggcct gaacagcacc 528 0 

65 gggctagcca gcgccgccag cggtgacggc ggcaacggcg gggccggcgg ggccggcggc 5340 

aacggcggag ccggcgggct cggcgggggc ggtggcacag gcggcaccaa cggcaacggc 5400 

ggcctcggcg gaggcggcgg caacggcgga gccggcggtg ccgggggaac gcccaccggc 5460 
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agtggcaccg aggggaccgg cggcgacggt ggagatgccg gcgccggcgg caacggcggc 5520 
tctgccaccg gcgtcggtaa cggcggtaac ggcggtgatg gcggcaacgg cggcgacggc 558 0 
ggcaacggcg cacccggcgg cttcggtggc ggcgctggcg ccggcggctt gggcggctcc 5640 
ggcgccggcg gcggcaccga cggcgacgac ggcaacggcg gcagccccgg caccgacggc 5700 
agctaa 5706 
<212> Type : DNA 
<211> Length : 5706 

SequenceName : SEQ ID 519 

SequenceDe script ion : 

Sequence 



<213> OrganismName : Mycobacterium tuberculosis H37Rv 

<;40Q>„ PregequenceString : . «. 

15 atgtcgttgg tgatcgtggc cccggagacg gtggcggccg cagccttaga tgtggcgcgc 60 

atcgggtcat cgatcggcgc ggccaatgcg gcggcggcgg ggtcgaccac cagcgtgctg 12 0 

gccgcgggcg ccgatgaggt gtcggcggcg atcgcgacgc tgtttggcag ccatgctcgg 180 

gagtatcagg cgatcagcac gcaggtggcg gcgtttcatg accgatttgc gcagacgtta 240 

agcgccgcgg tcggctcgta tgtcagcgcg gaggcgacca acgccgcacc gttggcgacg 3 00 

20 ctggagcaca acgtgctcaa tgccctcaat gcgcccaccc aggcgttgcfc gggtcgcccg 3 60 

ttgatcggtg atggggcggc tggagcaccc ggcaccgggc aggccggcgg ggccggcggg 420 

atcttgtggg gcaacggtgg ggccggcggc tcgggcgcgc ccggccaagt cggcggggcc 48 0 

ggcggggccg ccgggttgtt cggcaccggc ggggccggtg gggccggtgg ggccggcgcc 540 

gccggtgggg ccgggggtag cggcggctgg ctgctgggca atggtggagt cggcggggcc 600 

25 ggcgggcaga gcctgctggg cggggcaacc ggcggggccg gcggcaacgc cggactgttc 660 

ggggtcggcg gaaccggcgg gcccggcggg cccggcgggc ccggcggggt cggcggtacc 72 0 

ggtggtgccg gtggcctggg cgggaccctc tacggggccg gcgggcacgg aggtgccggc 780 

gggcccggcc cgatcggtgg tgtgggtgga cacggcggtg tcgggggtgc ggccgggctg 840 

ttgggcgtgg gcgggcacgg tggtgccggc ggacacggcg cggagggtgt ggccggcgca 900 

30 gccggtgagg acttgtcccc gcacggtacg tccggtgggg tcggcggcga cgccggcgat 960 

ggcggcaccg gagggcgggg cggctggctg gccggcgccg gtggggccgg cggggccggt 1020 

ggggttggcg ggaccggcgg ggccggcggg gccggatttt ctcgtgcctt gattgtcgct 1080 

ggggataacg gcggtgaccc cggcgccggc ggggccggag gcaccggcgg agccggctcc 1140 

acgatcggtg cccacggcgc ggccggggcc agccccacca gcggcggcaa cggcggggcc 12 00 

35 ggcggcaacg gcgcccactt ctcatcgggc ggcaaagccg gcggtaacgg cggggccggc 1260 

ggggccggcg ggctggtcgg caatggcggc gccggaggtg ccggcggcaa cggcgccccc 1320 

ggtgccccgc cttctggcgg cgaccctaac ggcggtggcg gcggagctgg cggcgctggc 13 80 

ggtaagggcg gcgacggcgg agcccaagca ggcgacggcg gtgctggcgg cgccggcggt 1440 

aagggcggca acggcggcaa cggcgccacc ggcgccaccg gccttaacgg cctgggggca 1500 

40 ggcgcggatg gcaccgacgg cggcaagggc ggcaacggtg gagccggcgg tggcggtgga 1560 

gccggtgggc aaggcggtaa ggcgcttgca gccacccacc aggacggcag catgggtgcc 1620 

ggcggggccg gcggaaacgg cggcgccggc ggcatgggcg gtgatggcgg caacggtgcc 1680 

cLa.gggca.cgt tcgataacgg cggcgatgga gttggtggca acggcggcaa cggcggtagc 1740 

cgcggcatcg gtggtgctgg tgggatcggc ggcgccggat ccaccgcggg tgcagacggt 18 00 

45 gcccgcggcg ccaccccgac cagcggcggc aacggcggca ccggcggcaa cggcgcgaac 18 60 

gccaccgtcg ccggcggggc cggcggggcc ggcggcaagg gcggcaacgg cgggcttgtt 1920 

ggtaatggcg gggccggcgg caaaggcggg gacggcatgg ccggtgtcgc cggttcctcg 1980 

cccaccaccg cgggcgaatc cggcacgagc ggccagaacg gcggggctgg cggggcgggc 2 040 

ggggccggcg gccggggcgg agacttcggg ggcgacggcg ggaccggtgg ggccggcggc 2100 

50 aacggcgcca acggcgccaa cgccaccacg ccgggcgcca agggcggcga cggcggccac 2160 

ggcgggcccg gcgcgcaagg cggtaacggc ggccaaggcg ggcccggcgg tttggcgggg 2220 

aacctctttg gccagaacgg aatccagggt gtcggcgggt ccggcggcaa agggggggcc 22 80 

ggcgggctcg ccggtgacgg cggcaacggc gccaacggca acttcgcctt cggcgatggt 2340 

aacggcggtc acggcggtaa cggcggtaac cccggcgccg gcgggcaggg cggtagcggt 2400 

55 ggcgccggct ctaccccagg cgccaagggc gcccacggct tcactccaac cagcggcggc 2460 

gacggcggcg acggcggcaa cggcggcaac tcccaagtgg tcggcggcaa cggcggcgac 2520 

ggcggcaatg gcggcaacgg cggcagcgcc ggcacgggcg gcaacggcgg ccgcggcggc 2 580 

gacggcgcgt ttggtggcat gagtgccaac gccaccaacc ctggtgaaaa cgggccaaac 2640 

ggtaaccccg gcggcaacgg tggcgccggc ggggccggcg gggccggcct gaacggcggt 2700 

60 aacggtggcg ccggcggcaa cggcggcctc ggcggattcg gcggcaacgg cgccgccggt 2760 

gccaacggcg tggccgtggg tgcgcccgga caacccggcg gtgccggcgg gcacggcggc 2 82 0 

gccggcggca acggcggggc cggcggcaac ggcggtcaag gcgtagtcag cgacggcgcg 2880 

ggcggtgccg gtggggccgg cggcgacggc ggtgctcccg gtgacggggc caacggcggc 2940 

aacggccagg gagcaggggc cttcgccggc ggcggcggcg ggcgaggcgg cgacggcggc 3 000 

65 aacgccggca acgccggtgc cggcggcccc ggcggcaccg gctccaccgc aggcaaggcg 3 060 

gggccggccg gcagcatcct gcacgacggc ggcaacggcg gtcacggcgg ccacggcgct 3120 

gccagcggcg gaaacggcgg ccccggcggc cacgggggta acggcggtaa cggcggcacc 3180 
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ggcgccaacg gcggcaacgg tggcatcggc 
aagggcgtcc tcggcaccaa cgagggcgat 
ggcagaggcg gcaacggcgg ccaaggcctc 
gggacacccg gcaacggcgg caacggtggc 
5 cctggtgacg gcggcggcgg gggccgcgga 
cttggcggtt ccagcggccc aggtggcacc 
ggcaccggcg gcaccggcgg ccagggtgcc 
actggcggca acggcggcaa cggtaacacc 
ggcaccggcc acaacggctc tcagcctggg 

10 ggcggcaatg gttttgccgg cgtcggtggc 
ggcggcaccg gcgacgcggg gccctttgga 
gggcagggcg gtggcggcgg cttcagcatc 
ggtagcccgg ggtccatcgc cacaggtacc 
ggcgggctcg .gcggcggcga attcgtgtag 

15 <212> Type : DNA 

<211> Length : 3990 

SeguenceName : SEQ ID 52 0 
SequenceDe script ion : 



gggactggcg gcgccggcag caccggcgcc 3240 

ggcggtgacg gcggcagagg cggcaacggc 3300 

accggagccg gcggcaacgg tgggaccggc 33 60 

aacggcgcca gtggcgacct tgtcacctca 3420 

ggcgatgcag gacgcggagg cgatgcagga 3480 

cccggcgact ggggcaccgg cggcaccggc 3540 

aacggcggcc tcaccggcgg cagaggcggc 3 600 

ggcggcaccg gcggcgctgg cggcaccggc 3660 

atgggtggca acggaggtgc gggcggcttt 3 720 

cggggcggga tgggtggctc cgggggcacc 3780 

acaggcaccg gtgggaccgg tggccacggg 3 840 

ctccttggtc tcggcggtct cggcggcctc 390 0 

gccggcggcg ccggtggcgg cggtggcttc 3960 

. 3990 • 



20 Sequence 

<213> OrganismName : Mycobacterium tuberculosis H3 7Rv 
<400> Pre Sequences t ring : 

atgtcatatg tgatcgcgac gccggagatg atggcaactg cggcttttga tctggcgcgt 60 

25 attggttcgc aggtgagcgc ggctagtgcg gtcgcggcga tgccgacgac ggaagtggtg 12 0 

gccgccggcg ccgacgaggt gtcggcgggc atcgcggcgt tgttcagcgc gcacgctcag 180 

gagtatcagg cgctaagtgc gcaggccgcg gcgtttcacg accagttcgt gcacacgctg 240 

accgcggccg cgaggtggta cacggccacc gagatcgcca acgccgcggc gatgcgagtc 30 0 

gtgctcggag ccgtgaatgc gcccacccag acgctactgg gacgcccgct gatcggcgat 3 60 

30 ggtgcgcacg ggacagcgcc tgggcagccc ggtggggccg gtggattgtt gttcggcaat 420 

ggcggcaacg gcgctgccgg tgccgtcggg caggtgggcg gcgccggcgg ggcggccggg 48 0 

ttatttggga tcggcggcgc cggtggcgcg ggcggggcgg gcgcacccgg aggtaccggc 540 

gggacgggtg gatggctggc gggtgggggc ggcgtcggcg gtatgggtgg ggctggtggt 600 

ggcgccggcg gggcgggtgg caacgcgggc ctgttcggca acggcggcgc cggtggtgcc 660 

35 ggtggggctg gtggtggcgc cggcggcgcg ggcggtaacg cggggtggtt tggtcatggg 72 0 

ggcgctggcg gcgtgggtgg tgtaggtgcg gccggggcca acggtgctac gcccggtcag 78 0 

gatggggcgg ctggtgttgc cgggtcggac gacggtgccg gcggtgacgg tttggcaggg 840 

tcggacgggg gcgatggcgg tgccggtggg gtgggcggca acggtggtcg ^ggagggtgg 90 0 

cttctcggta acggcggcgc cggtggcgtt ggcggtgtcg gcggggccgg tggtgccggt 960 

40 gcggccggcg gtgccggcgg tgccggggct accggaataa atgggccggc cggtatctcg 1020 

gcggccggcg gtgacggcgg cgccggcggc aatggcggtg ccggcggaaa cggcggcgtg 1080 

ggcggcgccg ggggtgctgg cgggtcggct ggactattgg ggtatgtcgg ccgggccgga 1140 

gacggcgggg ctggcggggg tggggggctg ggtggagcgc ctggtgacgg cggtgccggc 1200 

ggcaacggtg gcagttggct ggccgccggt gacggcgggg ccggcggtca cggtggcgac 12 60 

45 ccgggcctgg gtggggctgg cggagccggg ggggcgtcgg gcggcgccgg tgctcgcgcg 1320 

ggggccaatg gtctggcggc cggcaacgac gggccggtca gcggcggcaa cggcggcaaa 13 80 

ggtggcaatg gcgcccacgc accggtcgcc ggcggtcatg gcggtaacgg cggtgccggt 1440 

ggcaacggcg ggttggtcgg tgacggtggg gccggcggtc atggcggtga cggagccgcg 1500 

ggtgccggct atgccgatat gacggcgatc ttcctgggtt catccggtac ccccggtgag 1560 

50 gatggaggta acggcggggc cggtggggcc ggcggcgcgg gtggggccca cgccggcgat 1620 

ggcggggccg gcggtgccgg cggcaacggt ggggccggcg gggccggtgg taacggagct 168 0 

cacggtttca atgctgtgct cgtatctgac ggcggcaacg gtggtgatgg cggagccggc 1740 

ggtcgcggcg gtgacggtgg ggccggtggg gctggcggtg acgcacctgc gggtcgggcg 1800 

ggcagccagg gtgttggcgg ggacggtggt gccggcgggg ctggtggggc gcccggtaac 1860 

55 gggggcagcg gtggccgcgg cgacatggct ttcaaggatg gtgacggtgg ggccggcggc 1920 

gatggtggtg acccaggcgc cggtgggaag ggtggcgccg gcggcgccgg cgccaccgag 198 0 

ggtgtgaccg gcgcgaccgg cgctaccgtg cacagtggtg gcaacggcgg caagggcggt 2040 

aacggcgccg acgccaccgt ggctggcgcg aacggcggca agggcggtgc cggtggtaac 210 0 

ggcgggttgg tcggtgacgg cggggccggc ggcgacggtg gtagcggcgc ggccggtgcg 2160 

60 aatggcgcga acgtaggtga ggacggcgcc gacggcaccc tctcggggca accgggcgaa 2220 

ggaagcgagg ccaacggcgg tcaaggcggc gttggcggtg gcggcgcggg cggtgccggc 2280 

ggtgacggtg gtgccgggag ttcggcgtta ggtagtggcg gcaacggcgg tcgaggcgat 2340 

gcggggcagg ccggcggcgc gggcggtgcc ggcggagccg gtggcgccgg tgggtcggtg 2400 

tcgggcgacg gtggacccgg tggcaagggc ggggccggcg gtgcgggtgg tgccggagct 2460 

65 agtggtggcg gcggcggcaa gggcgcctcc ggcgccgaca gcgctgaggc cgttggaggc 2520 

gccggcggga aaggcggcga cggcggtgtc ggcggcgttg gcggtgacgg cgggcccggc 25 B0 

ggtgatggcg gcgccggcgg ggccgcaccc gcgggtcagg tcggcagcca cggtgtcggc 2640 
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ggcgttggcg gtgacggcgg gctcggcgga gccggcggca acggcggtga cggcggtcat 2700 

ggcagtgatg gcggggacgg cggtgacggc ggtgaccccg gcgcgggagg cctcggcggc 2760 

ttgggcggcg acagcggcaa cggcacccgc gcggccagcg gtgtggacgc cagcgaccac 2820 

gggcccggca .gcggcggcaa cggtggtaac ggtggcaacg gtgcgcaagc cagcgtcgcg 2 880 

5 ggcggcgccg ggggcaacgg aggcgacggc ggcaatgccg gccgggtcgg cgatggtggc 2940 

gccggcggca atggtggcga tggcgcggcc ggcgccaacg gggccaattc gggcgcgcct 3 0 00 

ggttcagatg ccctcgccct cggccaaccg ggaggcaacg ggggtcaggg cgacgcgggg 3 060 

caggccggcg gtgccggtgg tgccggtggt gccggtggcg ccggtgggtc ggtgtcgggt 3120 

gacggcggtg ccggcggcaa cggcggggcc ggcggcaacg gcggtgtggg cgctagcggt 3180 

10 ggggccgggg ccaggggcgc caacggtatc gacagtatcg gcggcaccgg tggggccggc 3240 

ggtggtggcg gcgatggcgg tgccggcggg gtcggtggac atggcgggga cggtggggtc 33 00 

ggcggggccg caccttcggg gacggtcggc agtcacggca ccggtggggt cggtggcgac 3 3 60 

ggcggactcg gcggcgccgg cggcgttggc ggggccggcg gcaacggcgg cattggtatc 3420 

accgtcgqcg ycgccggcgg 3-g n Z3 c !-'Z ! 3*~ -^ggcggtg :c ""7Z n Z- ciggcggtcgc 3480 

15 ggcggtctgg ggggtgacag cggcaacggc acctccgcgg ccaacggcgt ggacgccagc 3540 

aaacacgggc cactgaccgg cggcgacggc ggcgtcggcg gcaacggtgc caaggccgcc 3 60 0 

gcggccggcg gcgacggcgg ccagggcggt gacggcggca acgccgggct attcggggac 3 660 

ggcggagccg gtggtgatgg ggccgacggc accgctgccg aagctctcgg cggtgatggc 3 72 0 

ggggccggtg gggctggagg caagggcggt gacgccggcg acatcggtga cggcggtgac 378 0 

20 ggcggcaagg gtggcgacgg cgcgcacggt gccctcggag ggctcaccgt tgctggcggc 3 84 0 

aacggtgggg ctggtggcgc cggtggggct ggtggcgccg gcggagcatt tctgggcgac 390 0 

ggcgggaacg gcggagccgg cggccaaggt ggggctggcc ggggcggcag ccccggcggc 3 960 

ggcggcgggg ttggtggaca cggcggggcg ggcggcgacg ccgggatgaa cggcggcggc 402 0 

ggaaccggcg gccagggagg caacggcgcg gccggtggcg cgggttggtc gcccgactcc 408 0 

25 gacctaaagg gcttcgacgg cttcgacggc ggcagcggtg gggcgggagg cgacggcggg 4140 

gccggtggcg ccggcggaac tcagaccggc gacggtggtg acggcggggc cggaggccta 42 00 

ggcggggctg gtggggtcgg cggtaacggc gttgacggct ttgacattaa cgaaacgacg 4260 

ggccgcgacg gcggcgacgg cggcgacggt ggctacggcg ggtggggcgg cgccggcgga 43 2 0 

aacggcgggg ccggcgggtc ggcacccgcc ggcgaggtcg gcaatcgagg cgttggcggg 43 80 

30 gacggcggtg acggcgggtc gggcggggac gctggtaatg gtggcttggg cggtgacggc 4440 

tttacctatc tcgcggattt tgacggagag cctggcggcg acggaggtga cggcggcgac 4500 

ggcgggtggg gccgccccgg cggacaaggc ggtttcggtt ccacaagtgg cgcgcacggc 4560 

aaggccggct tcggcgcccc cggcggtgac ggaggcgacg gcgggaacgg cggtcacggt 4620 

ggggacggca acggcagctt tgctgatgca ggcgacggcg ggccgggtgg caacggcggc 4680 

35 aacggcggct tgggtggggc cgggcgagac ggtggcgccc ccggtggtga cggaggcgac 4740 

ggcgggaccg gcggctccgg cggcttcggc gcccctccgc ctcgcagcat cggcggcggt 48 00 

gacggcgggg acggcgggcg cggaggtgac ggcggacgcg gtgccggtgg cttgacctcc 4860 

ggcggcgtcg gctcgtcggg cgagtcgggt ggctccggca acgggagggg cgaccctggc 4920 

tccggcggtt ccggcggtga aggcggcgaa ggcggcccct ccatctccgt taacgtcacg 498 0 

40 tag 

<212> Type : DNA 
<211> Length : 4983 

SequenceName : SEQ ID 521 
SequenceDescription : 



4 



Sequence 



4983 



<213> OrganismName : Mycobacterium tuberculosis H37Rv 
<400> • PreSequenceString : 

50 atgtcgtttg tgttggtttc gccggagacc gtggcggcgg tggccacgga tctcaagcgc 60 

atcggcgcct cgctggccca cgaaaacgcg tcggcggccg cttcgacgac ggcggtggtc 120 

tccgcggccg ccgacgaggt atcgacggcg gtcgccgctc tgttctccca acacgcccag 180 

ggctaccaag cggcggccgc tcaggtagca gcgtttcata gccggtttgt gcaagccctg 240 

acggccggtg ccggggcgta cgcatttgcc gaggcggcca acgcgtcgcc gctacagtca 3 00 

55 gccatgggtg cggtaagcgc gtctgcgcag acgctgttgt cgcgcccgtt gatcggcaat 3 60 

ggcgccaatg cgacgacgcc gggcggtaac ggcggcgacg gcggatggct attcggcagc 42 0 

ggcggcaacg gcgcgcccgg cgcggcgggc cagtccggcg gtaacggcgg gtcagccgga 480 

ctgtggggta acggcggcgc gggtggcgcc ggcggcagcg gcggcgccgc cggcggcaac 540 

ggcggtaacg gcgggtggct gttcggcgcc ggcggcaccg gcggtatcgg cggcaccggt 600 

60 gctcccggcg ccatgggcgg caccggcggc aacggcggca acggcgcgct gctgatcggc 660 

ggcggcggcc tcggcggcgc cggcggcatg ggtggcaccg gcggcggcac cggcggcacc 72 0 

ggcggcaacg gcggcaacgg cgcgctgctg atcggcgctg gtggtgtcgg aggtgctggc 780 

gggatcggtg gccagggtac cggcgccggc ggtgccgccg gcgccggcgg caccgggggc 84 0 

aacggcggcg ccggggggtt gttcatgaac ggcggcgacg gcggcgccgg cggtcaaggc 90 0 

65 ggcgacggtg cggccggcga cgcggctgcc agcgccggcg gcaccggcgg caaaggcggc 960 

caaggcggcg acggcggcac cggaggggcc ggcggcgcag gcccagtgct gttcggccac 102 0 

ggcggcgccg gcggcatggg cggccaaggc ggcaccggtg gaatgggcgg cgccggcgga 108 0 
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gacggcacca ccgtcatcgc ggccggtacc gggggggagg gcggcaccgg cggcgcggcc 1140 

ggcgccggcg gagccgcagg cgctcgcggg gctctcacca gcggcggcct agccggcggc 120 0 

gtcgggg'ccg gcggcaccgg cggcaccggc ggtaccggcg gcaacggcgc tgacgccgct 12 60 

gctgtggtgg gcttcggcgc gaacggcgac cctggcttcg ctggcggcaa aggcggtaac 132 0 

5 ggcggaatag gtggggccgc ggtgacaggc ggggtcgccg gcgacggcgg caccggcggc 13 8 0 

aaaggtggca ccggcggtgc cggcggcgcc ggcaacgacg ccggcagcac cggcaatccc 1440 

ggcggtaagg gcggcgacgg cgggatcggc ggtgccggcg gggccggcgg cgcggccggc 150 0 

accggcaacg gcggccatgc cggcaacaca ggtgacggcg gcgacggcgg gaccggcggt 1560 

aacggcggca acggcaccgg aggcgtgaac ggcgccgaca acaccctcaa ccccgacacc 162 0 

10 cccggcggcg ccggggagcc cggcggggcc ggcggggccg gcggggccgg cggggccgcc 1680 

ggcggcccgg gcggtaccgg cggtaccggc ggtaacggcg gcaacggcgg caacggcggc 1740 

aacggcggca acggcggcaa cggcggcaac ggcggcaatg ccggcaacaa cagcaccaat 18 00 

gccccagtcg gtggcgaagg cggcgccggc ggcgacggcg gcgccggcgg cgcaggcggg 18 60 

gccgccaacg gcggcaccgc gggcagccag ggcactgggg gcgtaggcgg cgacggcggc 1920 

15 gcgggcggca acggcggcgg cggcaaggct ggcaccggca acagcggcaa ctttggggtg 1980 

gacggcgaag ccggcttcag cggcggcgcc ggtggcaacg gcggcgtagg cggggccgcc 2040 

ggcgccaatg gcggaaccgg cggcagcggt ggtaatggcg gtgacggcgg tgcgggaggc 210 0 

attggcgggg ccggcggcaa cggcataccg ggcactggca cagagcctgc cgggggcacc 2160 

ggcgccaaag gtggagacgg cggcgacggt ggcgccggcg gcgcaggcgg caatgccggc 2220 

20 ggggccggcg gccagggcgg caatgccggc cagggtggcg ccggcggtgc gggcggcaac 22 80 

gccgtgattc ccggcgacgg cgtcgggaag gcgccgcacg gcgacgcggg cggcagcggc 2340 

ggagacggcg gcaaaggcgg ccagggcggt agtggcggca ccggcggatc cggtgccccg 24 00 

atcggtggcg gcgccggagg caccggaggg tccggcggac acgccggcaa gggtggcgcc 24 60 

ggcggcatcg gcgcacaggg caccaccatc accgtgcccg ggaacggcgg caacgccggc 252 0 

25 gacggcggca acggcggcaa cgccggcgcc ggtggaaacg gcggctccgg cgacttcggt 2580 

ggcaatacca ccagcggcgc ctccggcagc ggcggcaacg gcggcaacgc cggcaccgcg 2640 

ggtagcggcg gtgcgggcgg aaccggcggc accggcctta gcggcggcaa cggtggcaac 2700 

ggcggcaacg gcggcaacgg cggtgacggc ggtaacggcg cccacggcac cgtcggcgcc 2760 

cagttcgtcc cggccaccag cttgcccaca cccaacggcg gggccggtgg caacggtggc 2820 

30 accggaagca acggcggcgc gcccggcccc gccggggcgc ccggccccac taccggcggt 288 0 

aacgctggca gccagggcat cggcggcgac gggggcaacg gcggcgacgg cggtaaaggc 2940 

ggtgacggcg ccgacgctgt caacgtcgta ttcatgccga ctgagccaca ggccgcgacc 3 000 

ggcactgccg gcagcgccgg tgaccccacc ggcggtaacg gagggcccgg cactcccggc 3060 

agccccatgg ttgccccgcc cccgccaacg ccaatcactc aagtccaaca gggcggtgac 3120 

35 ggtggcgccg ggggcaccgg atccaccaac gccaacgacg gcacagccac cggcggaaag 3180 

ggcggagaag gcggagtcgg cagcattctc ggcgggcccg gcggcaacgg cggaactggc 3240 

ggcaacgcct cggcaaccgg caccaacggg gtggccaacg ccgggaatgg cggcaagggt 33 00 

ggcgacggcg gccagtttgg ggccggcggc aacggtggtg ccggcggcag" cgtaaccgac 33 60 

ggatccgccg gcagcaccgc aggcaacggc ggcaacggcg gcaacgcaac caacggcacc 3420 

40 atcgcaggcc aacccgccgg cggcaacggc tcggccggcg ggaaaggcgg cgacggcggc 3480 

aacatcgccg ccggtgccac cggcaccgcc ggcaacggcg ggaacggcgg caacggcaac 3 540 

gacggcgccg tcaacgccgg caccggcggc tccggcggga acggcggtaa cgccggtggc 3600 

ggcggcgcca atggcggcga cggcggcgcc ggcggcgccg gcggggccgg cgggcgtggc 3660 

ggcaagggca tcgacggcgg gttcggcggt gacggcggca acggcggcag caacaacggc 3720 

45 accggcgccg gtggcaacgg cggcaacggc ggcaccggcg gggtcggctc ggttggcgcg 3780 

gctggtggcg atggcggcaa cggcggcacc ggcggcttcg ccggtttcgg cggcaccgca 3840 

ggcaatggcg gttccggcgg cacgggcggg gccggcggcg acggcggcac cggcggggac 3900 

ggcggcaacg gcgttatcgc cggcggcggg gggaccggcg gcaacggcgg cgccagcggg 3960 

gccggcggcg ccggcggcac gggcgggttc gccggcaacg gcaatgccgg cggcaatggc 4020 

50 ggcaccggcg gcgcgagcga ggacggcgac aacggcaacg ctggcagcgg cgccaccggc 4080 

ggtaccggcg gcaacggcgg caccggcggc gacggcggcg ctgccgggct gggcggcgtc 4140 

gcgtga 4146 
<212> Type : DNA 
<211> Length : 4146 

55 SequenceNarae : SEQ ID 522 

SequenceDe script ion : 

Sequence 



60 <213> OrganismName : Mycobacterium tuberculosis H3 7Rv 
<400> PreSequenceString : 

atgtcgttcg taatcgcgac gccggagatg ctgaccacgg cggcaacgga tttggcgaaa 60 

attggttcga cgatcactgc ggccaacacc gcagcggccg cggtggcgaa agtgctgccc 120 

gcgtcagccg acgaggtgtc ggtggccgtt gcggcgttgt tcggcacgca cgcccaggaa 180 

65 tatcagaccg tcagcgccca agtggcgacg tttcatgacc ggttcgtgca gaccctgtcc 240 

gcggccgcga gctcgtacgt ggccgccgag gcggtcaacg ttgaacagag tttgctagcc 3 00 

gcggtcaatg cgcccaccca ggcgctgttc ggacgcccgc tgatcggcaa cggcgccgac 3 60 
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ggctcccccg ggaccgggca ggccggcggg 
aacggcgggt ctggggcgcc aggacaacga 
ggtaacggcg gcaacggtgg agccggcggc 
ggcggcgcgg gcgggtggct gtatggcaac 
5 gcggtcggcg gcaacggcgg ggccggcggt 
ggtggggccg gaggcaacgg catcgccggt 
ggatccggca ccgctggcgg ggccggcggg 
ggcggggtgc tgatgggcaa cggcggcaac 
ggcgccggcg gtgcgggcgc cagcggcgcc 

10 gccggcggaa acggcgggaa tggtggggct 
ggcggccacg gtctgctcgg cctcggcggc 
ggcggtgacg gcggagctcc cggtgacggc 
aaccttggtg ccggcgggac cggtggcaat 
ggcgccggtg. gcgccagcat cggcggcagc 

15 acctccacca gcggtggtaa cggcggcgac 
ggacagaccg gcgcgaacgg tggcaggggt 
gccggtgggg ccggtggccg cggcggggcc 
ggtcgccccg gcggggccgg cggcaccggc 
ggtgacggcg gcaccggagg tgctggcggg 

20 ggcagcatcg gcttcaatgc gtccgctccc 
ggtaacggtg ggcctggcgg agccggcggc 
gcctcgggcc agaacggcag ccagggcgct 
gggactcctg gcaacggcgg tcacggcgcc 
ggtggcgccg gtggccacgg cggtgatccc 

25 agcggctcca cccccggtgc caacggcgca 
ggcggcaacg gcggcagagg cgccgatgcc 
ggcaggggcg gtgatggcgg tttggtcggc 
ggcagcaaag gcctgcccgg tctgggcagg 
ggcggaaacg gtggggccgg tggatccggc 

30 ggggccggcg gcaccggcgg cgtcggcggg 
ggttccagcg cgggcgcgga cgggcacccc 
ggaaaaggtg gggacggcgg ggacggcggg 
ggccccggcg gtgccggcgg cgacggtggg 
ggcattgacg gcgccgacgg cgccaccgcg 

35 ggggccggcg gaaaggg-cgg ccgaggcggg 
ggcacaaccg gcagccaagg cgccggcggc 
cccggtgacg gcggcaacgg cgctaacggc 
aacggcggca acggcggcaa cgccggcccc 
tccactfctcg gtgcgaccgg ctcaagcagc 

40 aacggcggca acggcgacca tgccctcagc 
aacggcggca acggcagcct gcgcggcagc 
ggcaatgcca gcaggggcat gggcggtgac 
gggcagattg gcaacggcgg agcgggcggc 
ggcaatcccg gagccattac cggcagcggc 

45 caaggtggga gcgtcgccgg cgacggcgct 
ggcgggacgg gcctgcgggg caccaccggc 
ggcgcggacg gtcacggcgg taacggtggc 
ggcggcggcg gcggcaacgg cggggccggc 
ggcagccaag gcgccggcgg agatggtggg 

50 ggcggcgacg gcggcagagg tgcccacggc 
gggactggcg gaaatggcgg caccggcggc 
ggcgggaccg gttccaccct gggcgcgacc 
aacggtggcg tcggcggcag tggcggcctt 
ggcatgggcg gcgccggcgg caccagcacc 

55 ggcttcggcg gcgacggcct tgatgccagt 
ggtggcgacg gcttcaggac cgctggggcg 
gccgaccccg gcggtttatt cccgatccct 
accggcggca ccgcacacct cgggcccctg 
cagttcggca gccccggcgc cgacggccgc 

60 ggcgccggcg gcagcttcta g 
<212> Type : DNA 
<211> Length : 3921 

SequenceName : SEQ ID 523 
SequenceDescription : 

65 

Sequence 



ccaggcggca tctfcgtacgg caacggcggc " '^2(f 

ggcggggccg gcggggcggc gggcctaatc 48 0 

gtgggtacca ccggcggggc cggtggtcac 540 

ggaggcgccg ggggttttgg cggggccggg 600 

accgccgggt tgttcggtgt cggcggggcc 660 

gtcacgggta cgtcggccag cacaccgggt 720 

atcggcggca acggcggggc cggcggggcc 78 0 

ggcggggccg gcggcgaggg cgggcccggc 840 

cacgccacca acttgggcgc tgacggtcaa 9 00 

ggcgggaccg gcggggtcgg cgggcccggc 9 60 

agccacggcg ccggcggggc tggcggtagc 102 0 

ggcaacggcg ctaccgggac gtggggtcac 10 8 0 

ggcggcaacc cgggcgccgg tggggccggt 1140 

gcgcatggcg cgaacggogc gcccggcacc,. 12 00 

ggcggcaaag gcgccgacgc tatcagcagc 12 60 

ggcgacggcg gtcaggtggg taatggcggt 13 2 0 

ggcggtctcg gatttggatc cgaagcgccc 13 80 

ggggccggcg gcaacggcgg aactcaggcc 1440 

gccggcggcg atggcggatc cggaggtgct 15 0 0 

ggggccgcag gctcacccgg cgggaatggc 1560 

g a gggcgggg ccggcggtct cgcgttggcc 162 0 

ggcggtgacg gcggagccgg cgggaacggc 168 0 

gccggcgcgc tcggtgtcaa cggtggtgta 1740 

ggtgtcggcg gtgccggcgg ccagggcgga 180 0 

cccggcaaca cccccaccag cggcggcaac 1860 

accggctttg gccagaccgg cgcgtccggc 192 0 

aacggcggcg ccggcggcgc cggcggcaac 1980 

ctcggcaacc ccggcctgga tggcggcacc 2 040 

ggcgcctggg cgggcaacgg tggcaccggc 2100 

actggagggt ccggcagcga cggtgtcaac 2160 

ggcggcaccg gcggagttgg cggtaccggc 2220 

gccgcaccca acggcgtcgc cggcagccaa 2280 

accggcggag ttggcggtaa cggcggccgc 2340 

ggtgcccgcg gccaggatgg cggcgccggc 2400 

accggcgggc cgggcggggc cgggccggcg 2460 

aacggcggca gcggcggcac cggcggggac 252 0 

agcgtgttca ccaacaacgg catcggcggc 2580 

agcggggccg gcgggagtgg cggcgccggc 2640 

agcattcatg tcaacggcgg caacggcggc 270 0 

ggcaacggcg cggccggcgg caacggcggc 2760 

ggtggggccg gcggccacgg cggcaacggc 282 0 

ggcggaaccg gtggcgccgg cggaaacgcc 288 0 

aatggcggag acggcggcac cggtagtgac 294 0 

ggccgcggcg gcgacggcgg cgtaggcggg 3 00 0 

gacggcggcc ggggcggggc cggcggcacg 3 060 

gccaccggtg cgaccggcac attcgacgct 3120 

accggcgggg ttggcgggac cggcggcgcc 3180 

ggcaaagcgc tgtcgccaac gggcaacaac 3240 

gccggtggcg ccggcggcac cggcggcacc 33 00 

actctcttca gcagcctcgc tggcactggc 3 3 60 

accggcggca ccggtggtgc cggcggtgcc 3420 

ggtgcgacag gggcggccgg ccgcgccgga 3480 

ggctccgcct ttggccccgg cggcaccggc 3540 

gtcagcgccg gcggtgacgg cggaaggggc 3 60 0 

tccggtggga atggcgggga cggcgggcac 3 660 

ggcgggcgag gtggggacgg cggcaaaggg 3720 

ggggctggcg gtaagggtgg caccggtggc 378 0 

gccatcatcg gccaatccgg ccagcccggc 3 840 

ggcggggccg ggggcgcagg cgggggcggc 3 90 0 
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<213> OrganismName : Mycobacterium tuberculosis H3 7Rv 
<400> PreSequenc eSt ring : 

atgtcggcgg ccgcggtcgc ctgggaccaa ctggcgatgg aattggcctc ggcagcggcc 60 

tctttcaact ccgtgacgtc gggcctggtc ggcgaatcgt ggctcggacc gtcatcggcg 12 0 

5 gcgatggccg ctgcggtagc gccgtaccta ggatggcttg ccgcggcagc ggcccaggcc 18 0 

cagcggtcgg caacccaagc cgcggccctc gtggccgagt tcgaggctgt ccgggcggcg 240 

atggtgcaac cggcgctggt ggcggccaac cgctccgacc tggtgtcatt ggtgttctca 3 00 

aacttcttcg ggcagaatgc tccggcgatc gctgcgattg aggccgcata cgaacagatg 360 

tgggccatcg atgtgtcggt gatgtcggcc taccatgccg gggcatcggc ggtggcgtcg 420 

10 gccctgacgc cgttcactgc gccgccgcag aatctgacag acctgcccgc ccagttggcg 48 0 

gccgctccgg cggccgtcgt caccgcggcg atcaccagtt ccaagggtgt gctggcaaat 540 

cttagcttag gcctggcaaa ctcgggcttc ggacagatgg gcgccgctaa ccttggcatt 600 

ttgaacttgg gcagcctaaa tcccggcggc aacaacttcg gccttggaaa tgtcggcagc 660 

aacaacgttg gcttgggcaa traccggcaac ggaaacatcg gcttcggccia cacgggcaac- 72-0 

f5 ggaaacatcg gcttcggcct caccggcgac aaccagcagg ggttcggcgg ctggaactcg 78 0 

gggaccggca atatcggctt gttcaactca ggcaccggca acatcggcat cggcaatacg 840 

ggcaccggaa acttcggcat cgggaactca ggtaccagct acaacacggg tatcggcaac 90 0 

acgggccaag ccaacacggg cttcttcaac gccggcatcg ccaacactgg catcggcaac 960 

acgggcaact acaacacggg cagcttcaat ctaggcagct tcaacacggg cgacttcaac 102 0 

20 acgggcagct ccaacacagg cttcttcaac cccggcaacc tcaacaccgg cgtgggaaac 108 0 

accggcaacg tcaacaccgg tggattcaac tctggcaact acagcaacgg cttcttctgg 1140 

cgaggcgact accagggctt gatcggcttc tccggcacac tgaccattcc cgctgctggc 12 0 0 

ctagacctca acggcctcgg ctccgtcggc cccatcacca tcccgtccat caccattccc 12 60 

gaaatcggcc tgggcattaa cagttccgga gcgttggtgg ggccgatcaa tgttccgcct 1320 

25 attactgttc ccgccatcgg gctgggcatc aacagtaccg gggcactcgt tggccccatc 13 8 0 

aacatcccgc cgatcactct aaattctatt ggcctagagc tatcggcgtt ccaggtcatt 144 0 

aacgtgggat cgatttcgat ccccgcgtct ccgcttgcga tcggcttatt cggcgtaaat 1500 

cccaccgttg gcagcatagg cccgggtagc atatcgatac agctaggcac tcctgagatt 1560 

cccgcgattc cgccattttt ccccggattc cctccagatt acgtgacagt gagtggtcaa 162 0 

30 atcggtccca tcaccttctt atcgggtggt tatagtttgc cggctattcc gttgggtatt 168 0 

gatgtgggtg gagggttagg cccgtttacg gtgttcccgg atggctactc acttccggca 1740 

atcccgttgg gtattgatgt gggcggaggg ttaggcccgt ttacggtgtt cccggatggc 180 0 

tactcacttc cggcaatccc gttgggtatt gatgtgggcg gagggttagg cccgtttacg 1860 

gtgttcccgg atggctactc acttccggca atcccgttgg gtattgatgt gggcggcgcc 1920 

35 atcggccccc tcactacccc gccaatcacg atcccctcaa tcccgttggg catcgacgtg 1980 

tccggcagcc tcgggccgat caacatcccg atcgaaatcg cgggcacccc aggcttcgga 2040 

aactcgacca ccaccccgtc atcgggcttc ttcaacagcg gtacgggcgg cacatcgggt 2100 

ttcgggaacg tcggttcggg cggatctggc ttctggaaca ttgctgggaa tctcggcaac 2160 

tccggattcc ttaacgtcgg gccactgaca tcgggaatct tgaacttcgg caacacagtc 222 0 

40 tcaggcctct acaacaccag cacgctgggc ctagcgacat cggcctttca ctccggcgtc 2280 

ggtaacaccg acagccaact cgccggcttc atgcgcaacg ccgcaggtgg gacgttattc 2340 

aacttcggct tcgccaacga cggcacactc aacttgggca acgcaaacct cggcgactac 240 0 

aacgtgggta gcggaaacgt cggtagctac aacttcggta gcggaaacat cggcaacgga 246 0 

agtttcggtt tcggaaacat cggaagcaac aacttcggtt ttggaaacgt cggcagcaac 252 0 

45 aaccttgggt ttgcaaacac gggtccgggg ttgacggagg ccctgcacaa cattggcttt 258 0 

gggaacatcg gcggcaacaa ctacggcttc gcgaacatcg gtaacggcaa catcggcttc 2640 

ggcaacacgg gcactggaaa tattggtatc gggctcaccg gcgacaatca ggtcggattc 270 0 

ggggcgctga actccggcag cggcaacatc ggcttcttca actccggcaa cggcaacatc 2760 

ggcttcttca actcaggcaa cggaaacgtc ggcatcggca actccggcaa ctacaacacc' 282 0 

50 ggcctgggta acgtgggcaa cgccaacacg ggcctgttca acaccggcaa cgtcaacact 2880 

ggaatcggca acgcaggaag ctacaacaca ggcagctaca acgccggcga caccaacacg 2940 

ggcgacctca acccgggcaa cgccaacacg gggtacctaa acctcggcga cctcaacacc 3000 

ggctggggaa acattggcga ccttaacacc ggcgccctga tctcgggcag ctacagcaac 3 060 

ggcatactgt ggaggggcga ttaccagggt ctgattggct actcagacac actcagcatt 312 0 

55 cccgccatcc cactgagcgt cgaagtgaat ggtggcatcg gtccgattgt ggtgccggat 318 0 

attactattc ctggtattcc gttgagcctg aacgcgctgg gtggtgtcgg tccgattgtg 3240 

gtgccggata ttactattcc tggtattccg ttgagcctga acgcgctggg tggtgtcggt 33 0 0 

ccgattgtgg tgccggatat tactattcct ggtattccgt tgagcctgaa cgcgctgggt 3360 

ggtgtcggtc cgattgtggt gccggatatt actattcctg gtattccgtt gagcctgaac 342 0 

60 gcgctgggtg gtgtcggtcc gattgtggtg cctgatatta ctattcctgg tattccgttg 3480 

agcctgaacg cgctgggtgg tgtcggtccg atcaccgttc ccggcgtccc tatttcccgc 354 0 

atccccctta cgattaacat caggataccg gtcaacatca ctctcaacga acttccgttt 3 60 0 

aacgtcgctg gtatcttcac gggctacatc ggccccatcc cgcttagcac attcgtatta 3660 

ggcgtcacgc tggccggcgg caccctggag tctggcatcc agggattcag tgttaatccg 3720 

65 ttcggtttga atattccgct gagcggtgct accaacgctg tcacgatccc tggtttcgcg 378 0 

attaatccgt ttgggttgaa tgttccgttg agcgggggca cgagcccggt tacgatccct 3 84 0 

ggtttcgcga ttaatccgtt tgggttgaat gttccgttga gcgggggcac gagcccggtt 3900 
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acgatccccg gcttcaccat tcccggatcc cccctgaact tgaccgccaa cggcggcttg 
ggaccgatca acatcccgat eaacatcacg agcgccccgg gcttcggaaa ctccaccacc 
accccgtctt cgggcttctt caacagtggc gatggaagcg catccggctt cggcaacgtc 
gggcccggca tttcgggcct ctggaaccag gtgccgaacg cgctgcaagg cggagtctcg 
5 ggaatctaca acgtcgggca gctggcgtcg ggcgtggcga acctaggcaa caccgtctcg 
ggcttcaaca acacgagcac cgttggtcac ctcaccgctg cgtttaactc gggcgtcaac 
aacatcggcc aaatgctcct gggcttcttc tcaccgggtg ccgggccgta a 

<212> Type : DNA 
10 <211> Length : 4311 

SequenceName : SEQ ID 524 
SequenceDe script ion : 

Sequence 
15 

<213> OrganismName : Mycobacterium tuberculosis H37Rv 
<400> Pre Sequenc eSt ring ; 

atggagtttc cggtgttgcc accggaaatc aactccgtgc tgatgtattc gggtgcgggg 60 

tcgagcccgt tgctggcggc ggccgcggcg tgggatgggc tggctgagga gttggggtcg 12 0 

20 gcggcggtgt cgtttgggca ggtgacgtcg ggcctgacgg cgggggtgtg gcagggtgcg 180 

gcggcggcgg cgatggcggc cgcggcggcg ccgtatgcgg ggtggttggg ttcggtggcg 240 

gccgcggccg aggcggtggc cgggcaggcg cgggtggtgg tgggggtctt tgaggcggcg 3 00 

ttggcggcga cggtggatcc ggcgctggtg gcggccaacc gggcgcggct ggtggcgttg 3 60 

gcggtgtcga atctgttggg gcagaacacg ccggcgatcg cggccgccga ggccgagtac 420 

25 gagctgatgt gggccgccga tgtggcggcg atggccggct accattccgg cgcgtcggct 480 

gctgccgcgg cgttgccggc gttcagccca ccggcgcagg cgctgggggg aggtgtcggc 540 

gcgttcctta ccgccctgtt cgccagccct gcgaaggcgc tgagcctgaa tgcgggtttg 600 

ggcaatgtcg gcaattacaa cgtcgggttg ggcaatgtcg gggtgttcaa cctgggcgcg 660 

ggcaatgtgg gtgggcagaa tctgggtttc gggaatgccg gtggcaccaa tgtcgggttc 720 

30 ggcaacctcg gtaacgggaa tgtcgggttc ggcaactccg gtctgggggc gggcctggcc 7 80 

ggcttgggca atatcgggtt gggcaatgcg ggcagcagca actatggttt cgcaaacctg 840 

ggtgtgggca acatcggttt cggcaacacc ggcaccaaca acgtcggcgt cgggctcacc 900 

ggcaaccacc tgacgggtat cgggggcctg aattcgggca ccgggaatat cgggttgttc 960 

aactccggca ccgggaatgt ggggttcttc aattcgggga ccgggaactt cggggtgttc 1020 

35 aactcgggta attacaacac cggtgtcggt aatgcgggga cggccagcac ggggttgttc 108 0 

aatgccggca atttcaacac cggcgtggtg aacgtgggca gttacaacac cggcagtttc 1140 

aacgccggcg acaccaacac cggtggcttc aaccccggcg gtgtgaacac cggctggctg 1200 

aacaccggca acaccaacac cggcatcgcc aactcgggca acgtcaacac cggcgcgttc 1260 

atctcgggca acttcaacaa cggcgtgctg tgggtgggtg actaccaggg cctgttcggc 1320 

40 gtctccgccg gctcgtcgat ccccgcaatt cccatcggcc tggtgctcaa cggcgacatc 1380 

ggcccgatca ccatccagcc catcccgatc ctgcccacca tcccgctcag cattcaccaa 1440 

accgtcaact tgggcccgct ggtggttccc gacatcgtga tccccgcctt cggcggcggt 1500 

atcggcatac ccatcaacat cggcccgctg accatcacac ccatcaccct gtttgcccaa 1560 

cagacatttg tcaaccaatt gccctttccc accttcagtt tagggaaaat cacaattcca 1620 

45 caaatccaaa cctttgattc taacggtcag cttgtcagct ttatcggccc tatcgttatc 168 0 

gacaccacca ttcccggacc caccaatcca cagattgatt taacgatcag atgggatacc 1740 

cctccgatca cgctgttccc gaatggcatc agtgctcccg ataatccttt ggggttgctg 1800 

gtgagtgtgt cgatcagtaa cccgggcttt accatcccgg gatttagtgt tcccgcgcag 1860 

ccgttgccgt tgtcgatcga tatcgagggc cagatcgacg ggttcagcac cccgccgatc 192 0 

50 acgatcgatc gcatccccct gaccgtgggg ggcggggtca cgatcggccc catcacgatc 1980 

cagggccttc atatcccggc ggcgccggga gtggggaaca ccaccacggc cccgtcgtcg 204 0 

ggattcttca actccggtgc gggtggggtg tcgggtttcg gcaacgtcgg cgcgggcagc 210 0 

tcgggctggt ggaaccaggc gccgagcgcg ctgttggggg ccggttcggg tgttggcaac 216 0 

gtgggcaccc tgggctcggg tgtgctcaac ctgggctcag ggatctcggg gttctacaac 222 0 

55 accagcgtgt tgcctttcgg gacaccggcg gcggtgtcgg gcatcggcaa cctgggccag 228 0 

cage tgt egg gggtgtcggc ggegggaace acgctgcgct cgatgctcgc cggcaacctc 2340 

gggttggcca atgtgggcaa cttcaacacc gggttcggaa atgtcgggga cgtcaacctg 240 0 

ggtgcggcca acatcggtgg gcacaacctg ggcctgggca atgtcgggga cggcaacctg 2460 

gggttgggca acatcggcca tggcaacctg gggtttgcca acttgggect gaccgccggc 252 0 

60 gcggcggggg tgggcaatgt tggttttggc aatgccggca tcaacaacta tggcttggcg 2580 

aacatgggtg tgggcaatat tgggtttgcc aacaccggca egggcaacat egggateggg 2640 

ctggtcgggg accatcggac egggateggg ggcttgaact ccggcatcgg caatateggg 2700 

ttgttcaact ccggcaccgg caaegteggg ttcttcaatt ccgggaccgg caacttcggc 276 0 

ategggaact ccggccgctt caacaccggg ateggtaata geggaaegge cagcaccggg 282 0 

65 ctcttcaatg ccggcagctt cagcaccggc atcgccaaca ctggtgacta caacacgggc 288 0 

agcttcaacg ccggcgacac caacaeeggt ggcttcaacc egggeggcat caacaccggc 294 0 

tggttcaaca ccgggcatgc caacaccggg ttggccaacg cgggcacctt cggcaccggc 3000 
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gccttcatga cgggcgacta cagcaacggc 
gtcggcgtcc gcgtcgggcc cacgatctcc 
9999tgggcc cgctgcatgt ggcgcccgtc 
gacgccaccg tcggcctggg tccgttcacc 
5 atcgccagca tcaccggaag cgtggacctg 
cttgacccgc tcgccggttc gatagggctt 
tttatcacca ttgatgcgtt ccaagttgtt 
gtgcccggcc tcacggttag cggtcagata 
ctcaacttgg acaccacccc gtggacgctt 

10 acccccgtga cggtgggtat ggaggtcgcc 
ctgacctttc cgcgggcctc cgccggggtc 
acgctgttgc ccgacgggtt caccctcgac 
ctcatcggcg atatcccgat cccgatcatc 
accaccacgg ccccatcgtc ggggttcttc 

15 gccaacgtcg gcgcgggcac gtcgggctgg 
9 c 999ctcgg gagttgccaa tgccggcacg 
gggatctccg ggtggtacaa caccagcacc 
ggcatcggca acctcggcca gcagctgtcg 
cggagcccca ttgtcaatat cgggtgggcc 

20 aatgtggggg acctcaactg gggtgcggcc 
aatctcggca gcgggaacgt cgggttcggc 
aactcgggtc cggcggtggg cctggccggc 
agcaacaact gggggctggc caacctgggt 
acgggcaaca tcgggatcgg gctggtcggc 

25 tcgggtagtg gcaatatcgg attgttcaat 
accggcaccg gcaacttcgg actgttcaac 
agcggaaccg gcagtactgg gctcttcaat 
cccgggtcgt acaacacggg cagcttcaat 
ccgggcgaca tcaacaccgg ctggttcaac 

30 accggcgccc tcatgtcggg gaccgacagc 
ggcctgttcg gcctgtccta tggcatcacg 
actggcggta tcggccccat cgtcatcccg 
cagatcaccg gcgacgcgga ctacagcttc 
cacatcggca tcaatggcgt cgtcaccgtc 

35 tccgccctga agaataacgg tagcttcatc 
gatattccgc ccatggattt cacgttaggc 
ctcggaccaa ttcatcttga gccaatcgtg 
gagcccatcc ccctggatgc gatttcgttg 
gatattccgg cctcggtcat cgatgggatt 

40 tccgtggaca tcccggcggt cacgatcaca 
ttcgacattc gcaccagtgc cggacccctc 
ccgggcttcg ggaactcgac ccagatgccg 
ggcggatcgg gcatcggcaa cttgggtgcg 
gcggggtcac tggtggggac actctcgggg 

45 gtgctgaact ccggcaccgc catctccggg 
accccggcgg tgatctcggg gttcagcaac 
gatggcctga tcgcgatccfc caccttccca 
gacgcggcca tcgccgagct gcagcacctc 
gtcggcgggg tgaacctcgg tttggctaac 

50 gtcggcaaca tcaacgtcgg cgccggcaac 
gtcgggaccg gcaacctcgg gttcggcaac 
gcgggcctga ccgcgggcgc ggggggcctg 
ggcagctggg ggttggccaa cgtgggtgtg 
ggcaacatcg ggatcgggct gaccggggac 

55 ggcaccggga acctcgggtt gttcaactcg 
gggaccggga acttcgggct gttcaactcg 
ggcacggcca gcaccgggtt gttcaacgcg 
ggctcctaca acaccggcag cctcaacgtg 
ggcaccgtca acaccggctg gttcaacacc 

60 ggcaacgtca acaccggcgc gttcaactcc 
ggtgactacc acgggctggt cggcttctcc 
ctggacctca acgaaaccct caacctgggc 
ggcatgtcgc tgttcgacgt ccacgaaatc 
gtcgatgttc ccgcgatacc gctagagatc 

65 ctggtgcccg ccaccacaat tcccgcacag 
tcacccgggt caaccatgac gcttccgctc 
atcctcgggt cgaccgcggc gattcccaat 



ctgttgtggc ggggcggcta cgagggcctg 3 060 

caattcccgg tcaccgtgca cgcgatcggc 3120 

ccggtacccg ccgtgcacgt cgagatcacc 3180 

gtcccaccga tcagcattcc ctcacttccc 3240 

gccgcaaaca ccatctcgcc gattcgcgct 33 0 0 

tttctcgagc cgttccgcct cagtgaccca 33 60 

gccggtgtct tgttcctaga gaacatcatt 342 0 

ttggtcaccc cgacaccaat tcccctaacc 348 0 

ttcccgaatg gtttcaccat tcccgcgcaa 3 540 

aacgacgggt tcaccttctt cccgggtggg 360 0 

accggactgt ccgtggggct ggacgcgttc 3 660 

accgtgccgg cgaccttcga cggcaccatc 372 0 

gatgtgccgg cggtgccggg gttcggcaac 3 780 

aacaccggcg gcggcggtgg atcggggttc 3840 

tggaaccagg ggcacgacgt gttagcaggg 390 0 

ctgagctcgg gcgtgctgaa cgtcggctcg 3960 

ctgggagcgg gcaccccggc ggtggtctcg 402 0 

gggttcttgg caaatgggac cgtgctcaac 408 0 

gatgtgggcg cgttcaacac cgggttgggc 4140 

aacatcggcg cgcagaacct gggcctgggc 42 00 

aacatcggtg ccggcaacgt cgggttcgcc 42 60 

ctgggcaacg tggggttgag caatgccggc 4320 

gtgggcaaca tcgggttggc caacaccggc 43 8 0 

gactaccaga ccggcatcgg cggcctcaac 4440 

tccggcaccg gcaatgtcgg gttcttcaac 4500 

tccggtagtt tcaacaccgg catcggtaat 4560 

gccggcaatt tcaacaccgg catcgccaac 462 0 

gtcggtgata ccaacaccgg tggtttcaac 4680 

accggcatta tgaatacggg cacccgcaac 4740 

aacggcatgc tgtggcgcgg cgaccacgag 48 00 

atcccgcaat tcccgatccg catcaccacg 4860 

gacaccacga tccttccgcc gctgcacctg 492 0 

accgtgcccg acatccccat ccccgccatc 4980 

ggcttcaccg ccccggaagc caccc tgctg 5040 

agcttcggcc ccatcacgct ctcgaatatc 5100 

ctgcccgttc ttggtcctat cacgggccaa 5160 

gtggccggga tcggtgtgcc cctggagatc 5220 

agtgagtcga ttcctatccg catacctgtt 52 80 

tcaatgtcgg aagtggtgcc gatcgatgcg 5340 

ggcaccacca tttccgcgat cccgctgggc 540 0 

aacatcccga tcatcgacat cccggcggcg 5460 

tcgtcggggt tcttcaacac cggtgccggc 552 0 

ggcgtgtcgg gcctgctcaa ccaggccggc 5580 

ctgggcaatg ccggcaccct ggcctcgggt 5640 

ctgttcaacg tgagcacgct ggacgccacc 5700 

ctcggcgacc atatgtcggg ggtgtccatc 5760 

cctgccgagt ccgtgttcga tcagatcatc 5820 

gacatcggca acgctttggc cttgggcaat 5880 

gtcggtgagt tcaacctggg tgcgggcaac 5940 

ctcggcggca gcaacttggg gttgggcaac 6000 

atcggtgccg gcaatttcgg attcggcaac 60 60 

ggcaatgtgg ggttgggtaa cgccggcagc 6120 

ggcaatatcg ggttggccaa caccggcacc 6180 

tatcggaccg ggatcggcgg cctgaactcg 6240 

ggcaccggca acatcgggtt cttcaacacc 63 00 

ggcagttaca gcaccggtgt ggggaatgcg 63 60 

gggaacttca acaccggtct ggccaatgcc 6420 

ggcagcttca acaccggcgg cgtcaacccg 64 80 

ggccacacca acaccggcct gttcaacacc 6540 

ggcagcttca acaacggggc gctgtggacc 660 0 

ttcagcatcg acatcgccgg cagcaccctg 6660 

cccatccaca tcgagcagat cgacatcccc 672 0 

gtcgagatcg gacccttcac catcccgcag 6780 

cacgaatcga tccacatgga tcccatcgtc 6840 

acgagaacca ttccgctgga catccccgcc 6900 

atcagcatgc gcttcgaagg cgaggactgg 6960 

ttcggagacc ccttcccggc gcccacccag 7020 
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ggcatcacca ttcacaccgg ccctggcccc 
ccgggtttcg agattccgca aatcgctacc 
ggtggtctgc cggccttcac cttgttcgcg 
ccgttaacga tcgatgcgtc cggcgcgctg 
5 acgatcgacc cgctgccgct gcacctggcg 
ccgatcatcg atgtcccgcc gacgccaggg 
gggttcttca actccggcgc cggtggggtg 
tcgggctggt ggaaccaggc ggcgagcgcg 
gtcggcacgc tgggctcggg tgtgctcaac 

10 accagcgtgt tgccgctcgg gacgccggcg 
cagctgtcgg gcgtgtctgc ggccgggacc 
gggttggcgg atgtgggcaa cttcaacgtc 
ggcgcggcca acctcggtgc gcaaaacctg 
ggcttcgcca acgtcggcca cggcaatatc 

15 gcggccggcc tgggcaacac ggggttcggc 
aaccagggcg tgcgcaacat cgggttggcc 
ctggtggggg acaacctcac cggcatcggg 
ttgttcaact ccggcaccgg caacatcggg 
atcggtaact cgggcagctt caacaccggc 

20 ctcttcaatg ccggcagctt caacaccggc 
agcttcaatg ccggcgacac caacaccggg 
tggttcaaca ccggccacac caataccggc 
gcgttcatgt cgggcaactt cagcaacggc 
ttcagcctgt tctacagcct cgacgtgccc 

25 ggcggcttcg gacccgtggt cctcccgccc 
accggaaacg tcgcgatggg cgcattcacc 
ccaaacatca ccggaagcgc cgccttccgc 
gtgagtgtca ttgtggagca aataatcaac 
cccttcgaaa tgtggactca aggcactaat 

30 tcggccgacg gttcgcccta cgccaccggc 
ggaagccatc tcaccatttc cgcgtccagc 
ggcccgatca cgttgggctt ccaggtgccc 
ggtggtttga cgttcccggc gacctcgctg 
ggcgtggaca tcccggccat cacctggccc 

35 tatgtcctcg ccagcagcat cccgctgatc 
agcaccatca ccccgtcgtc gggcttcttc 
ggcaacttcg gcgcgggcac ctcgggctgg 
gcgggctcgg gttttgccaa cgttggcacg 
ggtgtctcgg ggatctacaa caccagcacg 

40 ggcctgggca acgtcggcca ccaactgtcg 
ccggtgaccg ttctgaatat cgggttggcc 
aatgtcgggg aggtcaacct gggcgcggcc 
aatatcggcg ccggcaacct ggggttcggc 
aactcgggtc tgaccgcggg cgtgccgggc 

45 ggcaacaact gggggttggc caacgtgggc 
accggcaaca ttgggatcgg gctgaccggc 
tccggtgccg gcaacctggg gttgttcaac 
accgggaccg gcaacttcgg gttgttcaac 
agcggaacgg gcagcactgg gctcttcaat 

50 gccggcagct acaacacggg cagcttcaat 
ccgggcagca tcaacaccgg ctggctcaac 
gcgggcaatg tcaacaccgg cgccttcgtc 
cgcggcgact accagggcct ggccggcttc 
gcggtgggcg ccgacgtcag cggcgggatc 

55 atcccgccca ttccggtcgg cttcgccgcg 
gacatctctg ttccatccat tcacttgggc 
accgtcaacc ccattaccgt caggaccccg 
gtcaccagca cgtccggacc aacctcagag 
atccggatcg cgccctctag cggcgggggt 

60 gggcccatct ccatcccctc cggcacggtg 
ccgatcgaca tcggcctgcc ggtgtcgctg 
accctgatcc ccaccctccc gctgggcctc 
atcccggcca tcgttctcga ccggatcttg 
ccgatcaacg tcccgatcgc cgggttcggc 

65 ctgccgtcgt cgggcttctt caacaccgga 
ggcgcgggca tgtcgggatt gctcaacgcg 
ggcttcgcca acttcggcac ccagctctcc 



ggaacgaccg gcgagctcaa gatatctatt 70 80 

acgagattcc tgttggacgt gaacatcagc 714U 

ggtggcctga cgatccccac gaacgccatc 72 00 

gatccgatca cgattttccc gggtgggtac 72 60 

ctgaatctca ccgtgcccga cagcagcatc 73 20 

ttcggcaaca ccacggcgac cccgtcgtcg 73 80 

tcggggttcg gaaacgtcgg gtcgaacctg 7440 

ctggcggggt cgggatcggg ggtgttgaat 7500 

gtcggctcgg gtgtctcggg gatctacaac 75 60 

gtgctgtcgg gcctcggcaa cgtcggccat 7620 

gcgttgaacc agatccccat cctcaacatc 768 0 

gggttcggca acgtcgggga cgttaacctg 7740 

gggctgggca acgtcggcac cggcaacctc 78 00 
ggtttcggca attcgggtct - gaccgccggc „ 7§60 

aatgccggca gcgccaacta tggtttcgcc 792 0 

aacaccggca ccggcaacat cgggatcggg 7980 

ggcctgaact ccggtgccgg caatatcggc 8040 

ttcttcaact ccgggaccgg caacttcggc 810 0 

atcggcaata gcggaacggg cagcactggg 8160 

gtggccaacg ccggcagcta caacaccggc 822 0 

gggttcaacc cgggcaccat caacaccggc 82 8 0 

atcgccaact cgggcaacgt cggcaccggc 8340 

ctgttgtggc ggggtgatca cgagggcctg 8400 

cggatcacca tcgtggacgc ccacctcgac 8460 

atcccggtgc cggccgttaa tgcgcacctg 852 0 

attccgcaga tcgacatccc cgcactcacc 8580 

atcgttgtgg ggtccgtgcg cattccgccg 8640 

gcctcggttg gggcggagat gaggatagat 87 0 0 

ggccttggta taaccttcta ttcattcgga 8760 

ccactcgttt tcggcgccgg cacgagcgac 8820 

ggggcgttta ccactccgca gctcgaaact 88 8 0 

ggcagcgtca acgcgatcac cctcttcccc 8940 

ctgaacctgg acgtgaccgc cggcgccggc 9000 

gagatcgcgg cgagcgccga cggctcggtg 9060 

aacatcccgc ccaccccggg cattgggaac 9120 

aacgccggcg cgggcggggg atcgggcttc 9180 

tggaaccagg cgcacaccgc gctggcgggg 9240 

ctgcattccg gtgtgctcaa cctgggctcg 93 00 

ctgggggtgg ggaccccggc gctggtctca 93 60 

gggctgcttt ccggcgggtc cgcggtgaac 942 0 

aacgtcggca gccacaacgc cggtttcggc 948 0 

aacctcggcg cgcacaacct gggcttcgga 9540 

aatattggcc acggcaatgt cggagtcggc 9600 

ctgggcaatg tggggttggg caatgccggc 9660 

gtgggcaata tcgggttggc caacaccggc 972 0 

gactaccaga ccggcatcgg cggcctaaat 978 0 

tccggcgccg gcaacgtcgg gttcttcaac 9840 

tccggcagct tcaacaccgg cgtcggcaat 99 00 

gccggcagtt tcaacaccgg tgtggccaac 9960 

gtcggtgaca ccaacaccgg gggcttcaac 10020 

gccggcaacg ccaacaccgg ggtggccaac 10080 

accggcaact tcagcaacgg catcctgtgg 10140 

gccgtgggct acaccctccc gctgttcccc 102 00 

ggcccgatta ccgtgctgcc gcccatccac 102 60 

gtcggtggca tcggcccgat cgccatcccg 10320 

ctcgaccccg ccgtccatgt cggctccatc 10380 

cccgtgctcg tcagttactc ccaaggagcc 10440 

atttgggtca agcccagctt cttccccgga 10500 

gcaacgtcca cgcaaggggc atactttgtg 10560 

accttcccgg gattcaccat ccccctcgac 10620 

accatcccgg ggttcaccat cccgggcggc 10680 

gcgttgtcca atggcatccc gcccgtcgac 10740 

ctggacctgc acgccgacac cactatcggc 108 00 

ggggcgccgg gtttcgggaa ctcgaccacg 108 60 

gctggcggcg gttcgggctt tagcaacacc 1092 0 

atgtcggatc cgctgctcgg gtcggcgtcg 10980 

ggcatcctca accgcggcgc cggcatctcg 11040 
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ggcgtgtaca acaccggcgc gctgggtgtt 
aacgtcggcc agcaactgtc gggcttgctc 

<212> Type : DNA 
5 <211> Length : 11151 

SequenceName : SEQ ID 525 
SequenceDescription : 



gtcaccgcgg ccgtcgtctc gggtttcggc 11100 
ttcaccggcg tcgggcccta a 11151 



Sequence 
10 

<213> OrganismName : Mycobacterium tuberculosis H3 7Rv 
<400> PreSequenceString : 

ttgaattttc cagttctgcc accggaaatc aactccgtgc tgatgtattc gggtgcgggg 60 

tcgagcccgt tgctggcggc ggccgcggcg tgggatgggc tggct&agga gttggggtcg ...120 

15 gcggcggtgt cgtttgggca ggtgacgtcg ggcctgacgg cgggggtgtg gcagggtgcg 180 

gcggcggcgg cgatggcggc cgcggcggcc ccgtatgcgg ggtggttggg ttcggtggcg 240 

gcgcaggccg tggcggtggc cgggcaggcg cgggccgcgg tggcggcgtt tgaggcggcg 3 00 

ttggcggcga cggtggatcc ggcggcggtg gcggtcaacc ggatggcgat gcgggcgttg 3 60 

gcgatgtcga acctgctggg gcagaacgcc gcagcgatcg cggccgtcga ggccgagtac 420 

20 gagttgatgt gggccgccga tgtggcggcg atggccggct accattccgg cgcgtcggct 480 

gctgccgcgg cgttgccggc gttcagccca ccggcgcagg cgttgggggg tggtgtcggc 540 

gcgttcctca atgctctatt tgccggaccc gcgaagatgt tgaggcttaa cgcgggcttg 600 

ggcaatgtcg gtaattacaa cgtcgggttg ggcaatgtcg ggatattcaa cctgggcgca 660 

gccaatgtcg gtgcgcagaa tttgggtgct gccaacgccg gtagcgggaa tttcggtttc 72 0 

25 ggcaatatcg gcaacgccaa cttcgggttc ggc'aactcgg gtcttgggtt gccgccgggc 78 0 

atgggcaata ttgggttggg caatgcgggc agcagcaact acggcctcgc aaacctgggt 840 

gtgggcaaca tcggttttgc caacacgggt agcaacaaca tcgggatcgg gttgaccggg 900 

gacaacctga ctggcattgg gggcctgaat tcaggaaccg gtaatctggg gttgttcaac 960 

tccggcaccg gcaacattgg gttcttcaat tcggggaccg gcaacttcgg ggtattcaac 1020 

30 tcgggcagct acaacaccgg tgtcggtaat gcggggacgg ccagtaccgg gttgttcaac 1080 

gttggtgggt tcaacacggg tgtggccaac gtgggtagct ataacacggg cagcttcaac 1140 

gcgggcaaca ccaatacggg tggcttcaac ccgggcaacg tcaacaccgg ctggctgaac 1200 

accggcaaca ccaacaccgg catcgccaac tcgggcaatg tcaacaccgg cgcgttcatc 1260 

tcgggcaact tcagcaacgg tgtgctgtgg cggggtgact acgagggcct gtgggggctc 132 0 

35 tecggtggat cgaccattcc ggcgatcccc attggtctcg agctcaacgg cggcgtcggc 13 80 

cccatcaccg tgttgccgat ccagattttg cccaccatcc cgctcaacat tcaccaaacc 1440 

ttcagcctcg gcccgctggt cgttccggac atcgtgatcc ccgcttttgg tggcggtacg 1500 

gccataccta tcagcgtcgg ccccatcacc atctcgccca tcaccctgtt cccggctcag 1560 

aacttcaaca cgactttccc cgtcggcccc ttctttggct tgggggtcgt caacatttca 1620 

40 ggaatcgaaa tcaaagatct tgccggcaac gtcaccctcc aattaggtaa ccttaatatc 168 0 

gacaccagaa ttaaccagtc attcccggtg accgtcaact ggagtacccc ggcagtaacg 1740 

atcttcccga atggcatcag tattcccaac aatccactgg cgctgctggc cagcgcgtcg 18 00 

atcggcacgc tgggattcac gatcccgggc ttcaccattc ccgctgcgcc gctgccgctg 1860 

acgatcgaca tagacggcca gattgacggc ttcagcaccc cgccgatcac gatcgaccgc 192 0 

45 atcccgctga acctcggcgc cagcgtcact gtcggcccta tcctgatcaa cggcgttaat 1980 

atcccggcga ccccgggctt tggcaacacg accaccgctc cgtcgtcggg tttcttcaac 2040 

tccggcgacg gtggggtgtc gggcttcggg aatttcggtg cgggcagctc gggttggtgg 2100 

aaccaggcgc agaccgaggt ggctggggcg ggttcgggtt tcgccaattt cggttcgctg 2160 

ggatcgggtg tgctgaactt cggctcgggt gtgtcggggc tgtacaacac cggcgggttg 2220 

50 ccgccgggga ccccggcggt ggtctcgggc atcggcaatg ttggtgagca gctgtcgggg 22 80 

ttgtcctcgg cggggacggc actcaaccag agcctcatca tcaatctcgg gttggccgat 2340 

gtgggcagcg taaacgtcgg tttcggcaac gtcggggact tcaacctggg tgcggccaat 2400 

atcggcgact tgaacgtggg tttgggcaat gtcggcggcg gcaacgtcgg gttcggcaat 2460 

atcggcgatg ccaacttcgg gttgggcaat gcgggtctgg cggcgggcct ggccggggtg 2520 

55 ggcaacatcg ggttgggcaa tgccggcagc ggcaacgtcg gcttcggcaa catgggtgtg 2580 

ggcaacatcg ggttcggtaa caccggcacc aacaacctcg ggattgggct gaccggggac 2640 

aaccagactg ggatcggcgg cttgaactcc ggtgccggca acatcgggtt gttcaactcc 27 0 0 

ggcaccggca acgtcgggtt gttcaactcc gggaccggga acttcgggtt gttcaactcg 2760 

ggcagcttca acaccggcat cggcaatggc ggaacgggca gtactgggct tttcaatgcc 2820 

60 ggtaatttca ataccggtgt ggccaaccct gggtcgtaca acacgggcag cttcaatgtg 28 80 

ggtgacacca acaccggtgg tttcaacccg ggcagcatca acaccggctg gttcaacacc 2940 

ggcaacgcca acaccggcgt cgccaattcg ggcaatgtcg acaccggcgc cctcatgtcg 3 000 

ggcaacttca gcaacggcat cttgtggcga ggcaacttcg agggcctgtt cggcctgaac 3 060 

gtcggcatca cgattcccga attcccgatc cactggactt caaccggcgg catcggcccc 312 0 

65 attatcatcc cggacaccac gatccttccc cccatccacc tgggcctcac gggacaagcg 3180 

aactacggct tcgccgtgcc ggacatcccc attccggcaa tccacatcga cttcgacggt 3240 

gccgccgacg ccggcttcac cgccccggcc accaccctgc tttctgcgct gggcattacc 33 0 0 
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ggacaattca ggttcggccc gatcaccgtc tcaaacgtcc agctcaatcc gttcaacgtt 3360 

aacctcaagc ttcagttcct ccacgacgcg ttcccaaatg aatttcccga tcccacaatc 3420 

tcggttcaga tacaggtcgc catacccctt acttcggcaa cgctgggcgg attggccctg 348 0 

ccgctgcagc agaccatcga cgccatcgaa ttgccggcaa tctcgttcag ccaatccata 3540 

5 cccatcgaca ttccgccgat cgacatcccg gcctccacta tcaacggaat ttcgatgtcg 3 60 0 

gaggtcgtgc cgatcgatgfc gtccgtcgac attccggcgg tcaccatcac cggcaccagg 3660 

atcgacccga ttccgctgaa cttcgacgtt ctcagcagcg ccggacccat caacatctcg 372 0 

atcatcgaca ttccggcgct gccgggcttt ggcaactcga ccgagctgcc gtcgtcgggc 378 0 

ttcttcaaca ccggcggcgg tggcggctcg ggcatcgcca acttcggcgc gggggtgtcc 3840 

10 ggcttgctga accaggcctc gagtccgatg gtggggacgc tctccggcct gggcaatgcc 3900 

ggcagcctgg catccggtgt gctgaactcc ggcgtcgaca tctcgggcat gttcaacgtg 3960 

agcacgctgg gctccgcgcc ggcggtgatc tcgggtttcg gcaacctggg caaccacgtg 402 0 

tcgggggtgt ccatcgatgg cctgctggcg atgctgacca gcggcgggtc gggcggctcc 408 0 
gggcagccga gcatqatcga cgcggcgatc gccgagctgc ggcacctgaa ■ tccgctgaac - -414 0 

15 atcgtcaacc tgggcaacgt cggcagctac aacctcggct tcgccaacgt cggcgacgtc 4200 

aacctgggcg cgggcaacct cggcaacctc aacctcggcg gtggcaacct cggcgggcag 42 60 

aacctggggt tgggcaacct cggggacggc aacgtcgggt tcggcaacct cggccacggc 43 20 

aatgtcgggt tcggcaactc gggcctgggg gcgctgccgg ggatcggcaa catcgggttg 43 8 0 

ggcaacgccg gcagcaacaa cgtcggcttc ggcaacatgg gcctgggcaa catcgggttc 4440 

20 ggcaataccg gcaccaacaa cctcgggatc gggctgaccg gcgacaacca gaccgggttc 4500 

ggcggcctga actccggtgc cggcaacctg gggttgttca actccggcac cggcaacatc 4560 

gggttcttca acaccgggac cggaaactgg gggttgttca actcgggcag ctacaacacc 4620 

ggcatcggta acagcggaac gggcagtacc gggcttttca atgccgggag tttcaacacg 468 0 

ggtctggcca atgccggtag ttacaacacc ggcagcctca acgcgggcaa caccaacacc 4740 

25 ggcggcttca accctggcaa tgtcaacacc ggctggttca acgccggcca caccaacacc 480 0 

ggcggcttca acacgggcaa tgtcaacacc ggcgcgttca actccggcag cttcaacaac 4860 

ggcgcgctgt ggaccggtga tcaccacggg ctggtcggct tctcctacag catcgaaatc 4920 

accggcagca ccctggtgga catcaacgaa accctcaacc tcggtcccgt ccacatcgat 498 0 

cagatcgata ttcccggcat gtcgctgttc gacatccacg aactcgtcaa catcgggccc 504 0 

30 ttcaggatcg agcccatcga tgtccccgca gtggtgctgg acatccacga aacgatggtc 5100 

atcccgccca tcgtcttcct gccgagcatg acgatcggcg gtcagaccta cacgattccg 5160 

ctcgacacgc ccccggcccc cgccccgccg cccttcagac ttccgttgct gttcgtgaat 5220 

gcgctcggcg acaactggat cgttggggcg tccaactcaa ccggaatgag tggtggcttt 528 0 

gtcaccgcac ccactcaggg catcctgatc cataccggtc ccagcagcgc aaccaccggt 5340 

35 agcctcgcac taaccctccc aaccgtcacc atcccaacga tcacgacatc gcctatcccg 540 0 

ctcaagatcg atgtgtcggg cggtcttccg gccttcacgc tgttccccgg tggcctcaac 5460 

atcccgcaaa atgcgatccc gttgaccatc gatgcgtccg gcgtgctgga tccgatcacg 5520 

atattcccgg gtggtttcac gatcgatccg ctgccactga gcctggccct caacatcagc 5580 

gtgccggaca gcagcgttcc gatcatcatc gttccgccga cgcccggctt cgggaacgcg 5640 

40 accgccaccc cgtcgtcggg tttcttcaac tccggcgcgg gcggggtgtc gggtttcggc 5700 

aacttcgggg ccggcagctc aggctggtgg aaccaggcgc atgccgcgtt ggcgggcgcg 5760 

ggctcgggcg ttctcaacgt tggcacgctg aactcgggtg tgctgaacgt cggctcgggg 5820 

atatcggggc tgtacaacac cgctatcgtg ggtttgggga cgccggcgct ggtgtcgggt 5880 

gccggcaacg tgggccagca gctgtcgggg gtgttggcgg ccgggacggc gttgacccaa 5940 

45 agccccatca tcaacctcgg gttggccgat gtcggcaact acaacctcgg gttgggcaac 6000 

gttggggact tcaacctggg cgcggccaac ctcggcgacc tcaacctagg gttgggcaat 6060 

atcgggaacg ccaacgtcgg cttcggcaat atcggccacg gcaacgtcgg gtttggcaat 612 0 

tcgggcctgg gggcggcgct cggcatcggc aatatcgggt tgggcaatgc gggcagcacc 618 0 

aacgttggcc tggccaacat gggtgtgggc aacatcgggt tcgccaacac cggcaccaac 6240 

50 aacctcggga ttgggctgac cggcgacaac cagaccggca tcggcggctt gaactccggt 63 00 

gccggcaaca ttggcctgtt caactccggc accggcaaca tcgggttctt caactccggg 63 60 

accggaaact gggggttgtt caactcgggc agcttcaaca ccggcatcgg taatagcgga 642 0 

acgggcagta ctgggctttt caatgccggt ggtttcacta cgggtctggc caacgccggg 648 0 

tcgtacaaca cgggcagctt caatgtcggt gacaccaaca ccggtggctt caacccgggc 654 0 

55 agcatcaaca ccggctggtt caacaccggt aacgccaaca ccggcatcgc gaactcgggc 6600 

aatgtcgaca ccggcgccct catgtcgggc aacttcagca acggcatcct gtggcggggc 6660 

aactacgaag gcctattcag ctattcctac agcctcgacg ttccccggat caccatcctg 672 0 

gacgcgcatt tcaccggggc cttcggcccg gtggtcgtcc cgcccatccc ggttctggcg 6780 

atcaacgcgc acctgaccgg caacgcggcg atgggcgcct tcaccatccc gcaaatcgat 6840 

60 attcccgccc tcaatccgaa cgtcaccgga agcgtcggct tcggccccat cgcggtcccc 6900 

tcggtcacca ttcccgccct gaccgccgca cgagcggtcc tcgatatggc cgcgtcggtc 6960 

ggggcgacca gcgaaataga gccgtttatc gtctggacgt catccggtgc gatcggccca 702 0 

acgtggtact cggtcggcag aatctacaac gccggtgacc tgttcgtcgg cggcaatatc 708 0 

atctcgggaa tcccgacgct cagcacgacc ggcccggtgc atgccgtctt caatgcggca 7140 

65 tctcaggcgt tcaacacccc ggcgctcaat attcaccaga tcccgttggg tttccaggtg 720 0 

ccgggcagca tcgacgcgat caccctgttc cccggtggtc tgacgttccc ggcgaactcg 7260 

ctgctgaacc tggatgtgtt cgtcggcacc cccggcgcca ccattccggc gatcacgttc 7320 
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ccggagatcc cggcgaacgc cgacggcgaa ctctacgtca tcgccggcga catcccgctg 73 8 0 

atcaacatcc cgcccacccc gggcattggg aacaccacca ccgttccgtc gtcgggcttc 7440 

ttcaacaccg gggcgggcgg gggctcgggt ttcggcaact tcggcgcgaa catgtcgggg 7500 

tggtggaacc aggcgcacac cgctttggca ggcgcgggtt cgggtattgc caacgtcggc 75 60 

5 acactgcact ccggcgtgct caacctcggt tcggggctgt cggggatcta caacaccagc 7620 

acgctgccgc ttgggacgcc ggcgttggtg tcgggcctgg gcaacgtcgg tgatcacctg 768 0 

tcgggcttgt tggcctccaa cgtggggcaa aaccccatca ccatcgtcaa catcgggttg 7740 

gctaacgtcg gcaacggcaa cgtcggcctc ggcaacatcg gcaacctcaa cctgggtgcg 78 0 0 

gccaacattg gcgacgtgaa cctgggattc ggcaacattg gcgacgtgaa cctgggcttc 7860 

10 ggcaacatcg gcggcggcaa cgtcgggttc ggcaatatcg gcgatgccaa cttcgggttc 7920 

gggaattcgg gtctggcggc gggcctggcc ggcatgggca atatcgggct gggcaacgcc 798 0 

ggcagcggca acgtcggctg ggccaacatg ggcctgggca acatcgggtt tggcaacacc 8040 

ggcaccaaca acctcgggat cgggctcacc ggcgacaacc agtccggcat cggcggcttg 8100 

aactccggca. ctggcaacat tggcctgttc aactccggca caggcaatat cggcttcttc ■ &16Q 

15 aactccggga ctgccaactt cgggttgttc aactccggca gctacaacac cggtatcggc 822 0 

aactccgggg tggccagcac cgggttggtc aacgccggcg gcttcaacac cggtgtggca 82 8 0 

aacgccgggt cgtacaacac gggcagcttc aatgctggtg acaccaacac cggtggcttc 83 40 

aacccaggca gcaccaacac cggctggttc aacaccggta acgccaacac cggcgtcgcc 840 0 

aacgcgggca atgtcaacac cggcgccctc atcacgggca actttagcaa cggcatctta 84 60 

20 tggcggggca attacgaggg cttggccggc ttctccttcg ggtaccccat tccgctgttc 8520 

cccgcggtgg gcgccgacgt caccggcgac atcggccccg ccaccatcat tccgcccatc 85 80 

cacatcccgt ccatcccgtt gggcttcgcc gcgatcggcc acatcgggcc gatcagcatc 8640 

ccgaacatcg ccatcccctc gatccacctg ggcatcgatc ccaccttcga cgtcggccct 87 0 0 

atcaccgtgg accccatcac cctcaccatc cctggcctaa gtttggatgc tgccgtctcg 8760 

25 gagatcagga tgacgtccgg aagcagctcc ggattcaagg tcagacccag cttttcattc 8820 

ttcgcggtcg gacccgacgg catgcccggg ggcgaggtct ccatacttca accattcacc 88 8 0 

gtggcaccca tcaacttgaa cccgacgaca ctgcacttcc ccggattcac cattcccacc 8940 

ggacccatcc acatcggcct gccgctgtcg ctgaccattc cgggcttcac catcccgggc 90 00 

ggcaccctga ttccccaact cccgctgggc ctcggtttgt ccggcggcac cccacccttt 90 60 

30 gatctcccga cggtcgttat cgaccggafcc ccggtggagt tacacgccag caccaccatc 9120 

ggccccgtca gcctcccgat tttcgggttc ggcggagcac cgggctttgg caacgacacc 918 0 

accgcgccgt cgtcgggctt cttcaacacc ggcggtggtg gcgggtccgg cttctccaac 9240 

tccgggtcgg gcatgtcggg ggtgctcaac gcgatctcgg atccgctgct cgggtcggcg 93 0 0 

tcgggcttcg ccaatttcgg cacccagctc tccggcatcc tcaaccgtgg cgcgggcatc 93 60 

35 tcgggcgtgt acaacacggg cacgcttggc ctggtcacat cggccttcgt ctcgggcttt 942 0 

atgaacgtcg gccagcagct gtcgggcctg ctgttcgcgg gcaccgggcc gtaa 9474 



,<212> Type : DNA 
<211> Length : 9474 
40 SequenceName : SEQ ID 526 

SequenceDescription .- 



Sequence 



45 <213> OrganismName : Mycobacteri" 
<400> PreSequenceString : 
atgagttttg tcgtaatgcc gccagagatc 
ccgggccccc tgttggcggc ggccgcggcc 
gcggcggcgg ctttcgggtc ggtgacctcg 

50 tcctcggtcg cgatggcggc ggcggccgcc 
gcctccgccg agtcggcggc cgggcaggcc 
ttggccgaga cggtggaccc cttcgtcatc 
gcgttatcga atctgttcgg ccagaacacg 
gagctgatgt gggcccagga cgtggccgcg 

55 gcggccgagg cgttggcgcc gtttggctcg 
ccggccaagt cgctcgccgt caatctgggt 
agcggcaacg tcggcagcta caacgtgggg 
ggcggcaata tcggtggcaa caatgtcggg 
ctcgggaatt cggggttaac gccgggtctg 

60 gccggcagct acaatttcgg tttggcgaat 
accggcagtg ggaatttcgg fcattgggttg 
ttcaataccg gtagcgggaa tgtggggttg 
tttaactccg gcaccgggaa ctggggggtg 
ggtaattcgg ggattgccag cacggggttg 

65 gtcaatgcgg gtagctacaa caccggcagt 
ttcaacccgg gcagtgtcaa cacgggttgg 
gccaactccg gcgacgtcaa caccggtgcc 



.m tuberculosis H37Rv 

aactccctgc tgatatatac cggggcgggt 60 

tgggatgagc tggccgccga gctgggctcg 12 0 

gggctggtcg gtggtatctg gcaggggccg 180 

ccgtatgcgg ggtggttgag cgcggcggcg 240 

cgggcggtgg tgggtgtgtt cgaggcggcg 3 00 

gcggccaatc ggtcgaggtt agtgtcgctg 3 60 

ccggcgatcg cggccgcgga gttcgactac 420 

atgctggg^t accacaccgg cgcctcggcg 4 80 

ccgctggcaa gcctggcggc cgccgccgag 540 

ttggccaacg ttggcctctt caacgcggga 600 

gccggcaacg tcgggagcta caacgtgggc 6 60 

ttgggcaatg tcgggtgggg caactttggg 720 

atgggtctgg gtaatatcgg gtttggtaat 78 0 

atgggtgtgg gcaatattgg gttcgctaac 840 

accggtgata atctgaccgg gttcggtggt 900 

tttaattcgg ggaccggtaa tgtggggttc 960 

ttcaattcgg ggagctataa caccgggatc 1020 

ttcaacgcgg gtgggttcaa tacgggtgtg 10 80 

ttcaacgcgg gggaggccaa tacgggcggt 1140 

ttgaacaccg gtgacatcaa caccggggtg 12 0 0 

ttcatctccg gtaactacag caacggcgtc 12 60 
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ttgtggcggg gcgactacca 
gtcattcccc tcagcctgga 
cacatacttc ccgacatccc 
gtcccgccca tcaacgtccc 
5 ggccccatca aaatcaatcc 
acgctggcct ggcccgtctc 
ccttccccca ttcccacaac 
ccagtaacct tcagttattc 
cccaccgggg ggccgctcac 
10 attccgggat tctcgattcc 
atcaacgccc tgagcacccc 
atcggcggcg tcgggcccgt 
gggaattcga ccaccgcccc 
ggattcggga acgtcggcgc 
15 caggtgttgc cgggaacggt 
ggcaatgtag gtactcagtt 
ttcggattgg gcaatatcgg 
ttcggcctgg cgaacatggg 
atcggtatcg ggctcagcgg 

20 gagaatgtgg gattgttcaa 
gggaatttgg gcgtattcaa 
aacatcaacg tcctggcccc 
attgacctgc aggtgattgg 
gcattcgaca tacaaatcac 

25 atcaccattc ccgcgatccc 
gtgaatcccg cgtttgttct 
gtcgtcttgc cggacccgtt 
ggcgcgaaat tcccctccgg 
cccatcatcg gcagctacgg 

30 aatctgtcca catcgtcctt 
ccggtgaccg tcaacctgac 
gcgttcccgg aaaaccccgt 
accgtgttcc cgcagggttt 
tctattggcc ctttcccgtt 

35 attccggcgg tcccaggcgg 
ctcaacggcg gcatcggccc 
cccctcctga ccgtcacggg 
atccccgcca tcaattttgg 
actaccctgc tgtccggcct 

40 acgaatatac agacccagcc 
tttgacttca gagacggcat 
atcaccatgg cgggactcag 
ttctctttcg gttccccggc 
atcgacgtgt ccatcaacct 

45 attccgctga acgtggacgt 
gagccgggct cagggaactc 
ccgggcctgg gtgtgggact 
tcgggcttca attccgcggt 
ggcggcaatg tcgggttggg 

50 gggttcaatg tgggtggcgg 
tttggcaatg tcgggttggc 
atcgggtttg gtaatgccgg 
attgggttcg ctaacaccgg 
accgggttcg gtggtttcaa 

55 ggtaatgtgg ggttctttaa 
tataataccg ggatcggtaa 
ttcaacacgg gtgtggtcaa 
gccaatacgg gcggtttcaa 
atcaacaccg gggtggccaa 

60 tacagcaacg gcgccttctg 
cccgccgtgc ttccccaaac 
gtcgttatcc ccgccatcga 
atcgacagct tcactgtgcc 
gtcagcgtcg gcctcggccc 

65 acgctgaatt acttgtttgg 
ggactcttca acacccccat 
gctagctcgt cgcaagggac 



209/341 



gggcctgctc ggcttctcct cgggagcgaa 
cataaacggt ggtgtcggcg ccatcactat 
gatcaacatc aacgaaaccc tctacctcgg 
ggcaatctcg ctcggcgtcg gcatacccaa 
catcaccctg tggccggcac aaaacttcaa 
gtccataaca attccccaaa tccaacaggt 
cttgatcggc ccaatacata tcaataccgg 
caccccagcc ctcacccttt tcccggttgg 
cctgactctt ggcgtaacgg caggcaccga 
cgagcaacca cttccgctgg cgatcaacgt 
ggcaatcacc atcgacaaca tccccctgaa 
cgacatcgtg ggcggcaacg ttccggcgtc 
gtcctcgggc ttctttaaca ccggcgccgg 
gcacacctcg ggctggttta accagtccac 
ttcgggctat ttcaactccg gcacgttgat 
gtcggggatg ttgtcgggcg gcgcgctggg 
gttcgacaac gtcgggttcg gcaatgccgg 
catcggcaac atcgggttgg ccaataccgg 
ggacaatctg actgggttcg gtggtttcaa 
ttcgggtacc ggcaatgtcg ggttcttcaa 
ttcgggaagc cacaacaccg gtttcttctt 
cttcaccccc ggcacccttt tcaccatttc 
cgggatcggt cccatccacg tccagcccat 
cgggggattc atcgggatcc gcgaattcac 
gatccacgtg accggaaccg tcggcctcga 
tttcgggcaa accgccatgg cagaaatcac 
catcacgatc gaccactacg gcccgccgct 
gtcgttctac ctcagcatca gcgacctgca 
cggcccgggc acgatccccg gcccgttcgg 
ggccttgttc ccggccggcc tcaccgttcc 
cggcggcctg gacagcatca cgctattccc 
ggtcagcctc accaacttct ccgtcggcac 
cacggttgac cgcatcccgg tggacctgca 
ccggtgggac tacatcccgc caaccccggc 
ctttggcctc accagcgggc tgtttccatt 
catcagcatc ccgaccacca ccgttgtgga 
aaacctcgag gtcggcccct ttaccgtccc 
cctcgacgga aacgtcaatg tgagttttaa 
gggcatcacg ggaagcatcg atatctccgg 
cgcccagctg ttcatgtcgg tcggtcagac 
cgaactgaat ccgattgtaa ttccgggaag 
tatcccgttg cccactgtca gtgaatccat 
atccaccgta aagtcaatga ttctgcacga 
cgaggacgcc gtcttcatcc cggccaccgt 
cacaatcccg gtgggcccca tcaacatccc 
caccaccacc acgtcggatc ccttcagcgg 
gttgggcctg ttcgacggga gcatcgccaa 
cgggatcgtc ggcccgaacg tggggttgag 
caatgtcggg gactttaacc tgggtgcggg 
caatatcggc ggcaacaatg tcgggttggg 
gaattcgggg ttaacgccgg gtctgatggg 
cagctacaat ttcggtttgg cgaatatggg 
cagtgggaat ttcggtattg ggttgaccgg 
taccggcagc gggaatgtgg ggttgtttaa 
ctctggcacc gggaactggg gggtgttcaa 
ttcggggatt gccagcacgg ggttgttcaa 
tgcgggtagc tacaacaccg gcagtttcaa 
cccgggcagt gtcaacacgg gttggttgaa 
ctccggcgac gtcaacaccg gcgccttcat 
gcggggcgac taccagggcc tgctcggctt 
gccgttcctg gacctcaccc tcaccggcgg 
cattcccgcg atccgccccg agttcagcgc 
gagcatcccg attccccaga tcgacctggc 
catcaccgtc ccgcacctcg atattccacg 
ctcacaaccc ggcgggcccc tgaaaatcgg 
cggccttacc cccctggcgt taagccagat 
catcacggct ttcctggcca acctgccgtt 
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cgttcttcct 


1320 


cgagcccatc 


1380 


acccttggtc 


1440 


tatttcgatc 


1500 


ccaaaccatc 


1560 


cgccctcagc 


1620 


gttttccatc 


1680 


cctcagcatt 


1740 


ggccttcacc 


1800 


gattggccac 


1860 


cctgcacgcc 


1920 


gccggggttc 


1980 


cggggtgtcg 


2040 


ccaggccatg 


,-2100 . 


gtcgggtatc 


2160 


cggcaacaac 


2220 


cagcagcaat 


2280 


caacgggaat 


2340 


ttcgggcagc 


2400 


ctccggcacc 


2460 


aacgggcaac 


2520 


cgaaatcccc 


2580 


cgatattccg 


2640 


tctcccggag 


2700 


gggttttcac 


2760 


cgcggacccg 


2820 


aggcccaccc 


2880 


gatcaatgga 


2940 


cgcgactttc 


3000 


cgaccaaaca 


3060 


gggcggcctg 


3120 


cggtggattc 


3180 


caccaccctt 


3240 


caacggcccc 


3300 


tcacttcacc 


3360 


tgcgctaaat 


3420 


ggacatcccc 


3480 


cgccccggcc 


3540 


aatccaaatc 


3600 


tcttttcttg 


3660 


ttctattccc 


3720 


accgctcaac 


3780 


aatcctgcct 


3840 


gctgcccgca 


3900 


gatcatcacc 


3960 


tttggctgtc 


4020 


caacctgatc 


4080 


caacctcggt 


4140 


caacgtcggt 


4200 


caatgtcggg 


4260 


tttgggtaat 


4320 


tgtgggcaat 


4380 


tgataatctg 


4440 


ttcggggacc 


4500 


ttcggggagt 


4560 


cgcgggtggg 


4620 


cgcggggcag 


4680 


caccggtgac 


4740 


ctccggtaac 


4800 


ctcctaccgc 


4860 


actgggctcc 


4920 


caacgtcgcc 


4980 


cgccaccacg 


5040 


ggtgcccgtc 


5100 


tccgattacg 


5160 


agtcatcggt 


5220 


cagcaccccc 


5280 
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gtcgtcacca ttgacgagat cccgctgctg gccagcatta ccggccacag cgagcccgtc 5340 

gacatcttcc cgggcggcct cacgatcccc gcgatgaacc cgctgagcat caacctgtcc 5400 

ggtggcaccg gcgccgtcac cattccggca atcaccatcg gcgaaatccc ctttgacctc 54 60 

gtggcccaca gcacgctcgg ccccgttcac atcctcatcg accttcccgc cgtgccgggg 5520 

ttcgggaata cgaccggtgc tccgtcgtcg ggtttcttca actccggtgc gggtggggtg 55 8 0 

tcggggtttg ggaatgtcgg cgcgatggtg tcgggtggct ggaatcaggc tccgtcggcg 564 0 

ttgctgggtg ggggttcggg tgttttcaac gccggcacgc tgcattcggg tgtgctgaat 570 0 

ttcggctctg gcatgtcggg gctgttcaac accagcgtgt tggggttggg tgcgccggcg 5760 

ttggtgtcgg gtttgggtag tgtcggtcag cagttgtcgg gattgttggc gagcgggacg 58 2 0 

gcgctgcatc agggtctggt cctcaatttc gggttggccg atgtggggtt gggcaatgtc 58 8 0 

gggttgggca atgtcgggga ctttaacctg ggtgcgggca acgtcggtgg gttcaatgtg 5940 

ggtggcggca atatcggcgg caacaatgtc gggttgggca atgtcgggtg gggcaacttt 60 0 0 

gggctcggga attcggggtt aacgccgggt ctgatgggtt tgggtaatat cgggtttggt 60 6 0 

aatgccggca gctacaattt cggtttggcg aatatgggtg tgggaaatat tgggttcgct 6X20 

aacaccggca gtgggaattt cggtattggg ttgaccggtg ataatctgac cgggttcggt 618 0 

ggtttcaata ccggcagcgg gaatgtgggg ttgtttaatt cggggaccgg taatgtgggg 6240 

ttctttaact ctggcaccgg gaactggggg gtgttcaatt cggggagtta taacaccggg 63 0 0 

atcggtaatt cggggattgc cagcacgggg ttgttcaacg cgggtgggtt caacacgggt 63 60 

gtggtcaatg cgggtagcta caacaccggc agtttcaacg cggggcaggc caatacgggc 6420 

ggtttcaacc cgggcagtgt caacacgggt tggttgaaca ccggtgacat caacaccggg 648 0 

gtggccaact ccggcgacgt caacaccggc gccttcatct ccggcaacta cagcaacggc 6540 

gccttctggc ggggcgacta ccagggcctg ctcggcttct cctacaccag caccatcatt 6600 

cccgaattca ctgtcgcgaa catccacgcg tccggcggcg ccggacccat catcgttccg 6660 

tcgatccaat ttccggcaat tcccttggac ctcagcgcaa ccggccacat cggcggcttc 6720 

accatcccgc cggtgtccat ttccccgatc acggttcgca tcgacccagt cttcgacctc 6780 

ggccccatca ccgtccagga catcacgatt cccgccctgg gactcgaccc cgcaaccggt 684 0 

gtcaccgtgg gcccgatatt cagctcaggc tccatcatcg atccattcag ccttacgctg 690 0 

ctggggttca tcaacgttaa tgtccccgcc atccaaacgg cgcccagcga gattctgcca 69 60 

ttcaccgtgc tgctgagttc gcttggcgtg acccatctaa caccggaaat caccatcccg 702 0 

ggattccaca tacccgtcga tccaatccat gtcgagctgc ccctgtccgt caccatcgga 70 8 0 

cccttcgtga gcccggaaat caccattccc caactcccgc tgggcctcgc gttgtccggc 714 0 

gccacccccg ccttcgcctt ccctctggag atcaccatcg accgaatccc agtggttctc 720 0 

gacgtcaacg cgctgctcgg ccccatcaac gccgggttgg tcatcccgcc cgtcccggga 7260 

ttcggcaaca ccaccgcggt cccgtcgtcg gggttcttca acatcggcgg tggcggcggg 732 0 

ttgtcgggct tccacaacct cggcgcgggc atgtcgggcg tgctcaacgc gatctcggat 73 80 

ccgctgctcg gatcggcgtc gggcttcgcc aacttcggca cccaactctc cggcatcctc 7440 

aaccgcggcg cggacatctc gggtgtgtac aacaccggcg cgctgggcct gatcacctcg 7500 

gccctggtct ccggcttcgg caacgtcggc cagcaactgg cgggcctgat ttacaccggc 7560 

accgggccct aa 7572 
<212> Type : DNA 
<211> Length : 7572 

SequenceName : SEQ ID 527 

SequenceDescription : 

Sequence 



<213> OrganismName : Mycobacterium tuberculosis H37Rv 
<40 0> PreSequenceString : 

atgtcatttg tgattgcggt gccggaagca ttgacgatgg cggcttcgga tctggccaac 60 

attgggtcga cgatcaacgc ggcgaatgcg gcggcggcat tgccgaccac gggggtggtg 12 0 

gcggctgccg cggatgaggt ttcggcggca gttgcggcct tgttcgggtc gtacgcgcag 180 

agctatcagg cttttggtgc gcagctgtcg gcgtttcacg cccagttcgt gcagtccctt 240 

acgaacggcg cgcgctcata cgtagttgcc gaggccacca gtgctgcgcc gttgcaggat 3 00 

ttgttgggcg tggtaaatgc ccccgcccag gcgttgttgg ggcgcccgtt gatcggcaat 3 60 

ggcgctaacg gggccgacgg gacgggggct cccggtgggc cgggcgggct gttgcttggc 42 0 

aacggcggga atggcggatc gggtgcgccg ggtcagccag gtggtgctgg cggggatgcg 48 0 

gggttgatcg gtaacggcgg gactggcggt aaaggtgggg acgggctggt cgggtccggt 540 

gctgccgggg gtgtcggtgg tcgcggtgga tggttgctgg gtaatggcgg gaccggtggg 600 

gctggtgggg ctgcaggggc cactttggtc ggcggtactg gcggtgtcgg tggggcgacg 660 

gggttgatcg gcagcggggg cttcggcggt gctggcgggg ccgcggcggg ggtgggcacc 720 

accggcggcg tgggcgggag cggtggcgtt ggcggcgtgt tcggcaatgg tggattcggc 780 

ggggccggtg gccttggcgc cgccggcggc gtcggagggg cggccagcta cttcgggacc 840 

gggggcggtg gcggcgttgg tggggacggt gcgcccggtg gtgacggcgg tgcgggtccg 90 0 

ctattgatcg gcaatggcgg tgttgggggt ctgggtgggg ccggggcggc cggtggtaat 960 

ggcggtgccg gcgggatgtt gttgggcgat ggcggtgccg gcggacaggg tgggccggcc 1020 

gtggcgggtg tcctgggcgg gatgcccggc gcgggcggca acggcggtaa tgccaactgg 1080 

ttcgggtccg gtggtgccgg cgggcagggt ggcaccggtc tggccgggac aaacggggtc 1140 
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aaccccggct cgattgcgaa ccccaacacc ggtgcgaacg gtaccgacaa cagcggcaac 1200 

gggaatcaaa ctggcgggaa cgggggtccc ggccccgccg gtggcgtcgg cgaggctggc 12 60 

ggcgtcggcg ggcagggcgg gctgggggag tcgctcgacg gcaacgacgg caccggcggt 132 0 

aagggtggag ccgggggtac tgccggtacc gatggcggtg ccggcggcgc tggcggcgct 13 8 0 

5 ggcggcatag gtgagaccga cggcagcgcc ggcggcgtgg ctaccggggg tgaggggggt 1440 

gacggtgcca ccggaggggt cgacggtggc gtgggtggtg ctggcggcaa gggggggcag 15 00 

gggcacaaca cgggtgtagg tgacgctttc ggcggtgacg gcggaatcgg cggtgacggt 1560 

aacggggcac taggcgcggc gggcggtaac ggcggcaccg gtggtgccgg tggaaacggt 162 0 

ggacgtggcg ggatgttaat cggcaacggc ggcgccggtg gggccggcgg gacgggcggc 168 0 

10 accggtggtg gtggcgccgc cggcttcgcg ggcggtgtcg gcggcgcggg cggagagggt 1740 

ctcaccgacg gtgcgggtac cgcggaaggc ggcaccggcg gtctgggggg cctcggcggt 1800 

gtcggcggta ccggcggtat gggtggcagc ggcggtgtcg gcggcaacgg cggggcggct 18 60 

gggtcgctca tcgggcttgg tggtggcggg ggtgccggcg gtgtcggcgg caccggtggc 1920 

atcggcggca tcggtggtgc cggcggcaac ggtggcgccg gcggcgcggg taccaccacc . 19H0 

15 ggcgggggag cgacaattgg cggtggcggc ggtacaggcg gcgtgggggg cgctggtggc 2040 

actggcggta ccggcggcgc cggcgggacc accggcggca gcggcggagc cggcgggctg 2100 

atcgggtggg caggagctgc cggcggcacc ggcgcaggcg gcacgggtgg gcaaggtggc 2160 

ctcggcggcc agggcggcaa cggcggcaac ggcgggaccg gcgcgaccgg cggtcagggc 222 0 

ggcgatttcg cgctgggcgg caacgggggc gccggcggcg cgggtgggtc accgggtggc 228 0 

20 agctccggca tccagggcaa tatgggcccg cccggcaccc agggcgccga cggatag 23 37 

<212> Type : DNA 
<211> Length : 2337 

SequenceName : SEQ ID 528 
25 SequenceDe script ion : 

Sequence 



<213> OrganismName : Mycobacterium tuberculosis H37Rv 

30 <400> PreSequenceString : 

ccccaaggcg ccgacggcaa cgccggcaac ggcggtgacg gcggggtcgg cggcaacggc 60 

ggaaacggcg cagacaacac caccaccgcc gccgccggca ccacaggcgg ggccggcggg 12 0 

gccggcgggg ccggcggaac cggcggaacc ggcggagccg ccggcaccgg caccggcggc 18 0 

caacaaggca acggcggcaa cggcggcaac ggcggcaccg gcggcaaagg cggcaccggc 240 

35 ggcgacggtg cactcgcagg cagcagcggt ggtgccggcg gtaaaggcgg caacggcggc 3 00 

gacgccggca aggccggtac cggctccgct cctggcacgg cggggaccgg cggcgatggg 3 60 

ggtaagggcg gcaacggcgg cattggcgct gccggcacaa ccggccccgt aggcaccggc 420 

gcgtccggcg gcaccggtgg tagtggtggc gccggcggaa ccggcggtga cggcggcgcc 480 

gccaacggcg gcaccgccgg ggctggcggg gcgggcggca atggcggcaa aggcggcgac 540 

40 ggtggagcag gcgtcaccag cagcaccgcc ggcaacagcg gcggcgcggg cggcagcggc 60 0 

ggaaagggcg gagacgcggg cgcgggcggc gccggtgcca ctccgggcgc caacggtatc 660 

gctggcaatg gcggcgacgg cggagatggc gcggctggtg ccgtcggcat ctccggcgca 720 

accggcgctg gcgacggcgg gcatggcgga accggcgcgg ccggcggcaa cggtggaacc 78 0 

ggcggtgctg gcggtagcgg catcgacggc gtcggcggcg ggaccggagg taccggcggc 840 

45 aacggcggca acggcgccat cggcggcgct ggcggagacg ccggtggtag cggaaatagc 900 

ggcggaaacg gtgggattgg cggaaagggc ggaaacgccg gtgccggtgg tgccgcgggc 9 60 

agcaacggcg gtaccgtcgg cgccaacggt accggcggcg acggcggcaa cggcggcgct 1020 

gccggggccg ccacggctgg cagcaacggt ggggccggca ccggctcggc cggcggcaac 10 80 

ggcggcaccg gcggcagagg cggcagtggt ggcgccggcg gcgacggtat cggtggcgtc 114 0 

50 ggcggcggca agggcggcaa cggcgcggac ggcgaagtcg gcggtgcggg cggcgccggc 12 00 

ggcagcgggc ccaacaccag tcccggcggc aacggcgggc aaggaggtca aggcggcagc 12 60 

ggtggtgccg gtggggcggc cggggctggc ggcgccggtg gcggcgctaa cggcaccgct 13 2 0 

ggcaacggcg gccaaggcgg tgccggcggc accggcggcg ccggcgcagc ctcctcagct 13 80 

accaacggcg gcagcggcgg cgccggcggc accggaggcg acggcggcag cggcggcgcc 1440 

55 ggcggcaccg gaggcgccgg cggcaccggc ggggcggccg gcgacggcgg acaaggtggc 15 00 

cagggcggcg ccggcggcgg tgccggtggt caaggtggtg ccggcggtgc cggcgggacc 1560 

ggcggcaacg gcggcaatat caccggcggc accgcgggca ccgcgggggc cgccggtaac 1620 

ggcggcgccg ccggaaaggg tggcgccggc ggccaaggcg gcaccggtgg cgggaccggg 1680 

ggtcagggtg gcgccggcgg cgacggcggt gccggcggca ccggcggcga ccgcaccgtc 1740 

60 ggcggtggca cggtccccgc cggctccggt ggacaaggcg gtaacgctgg cggtggtggg 18 00 

gccggcgggc agggtggagc cgacggcggc agcggcggcg acggcggcga cgccggcaca 18 60 

ggtggcaatg gcggtaacgg cggcaaccgt aattccggca atggcaccgg cggcgctggc 1920 

ggcaacggtg gtggtggtgc taacggtggc gccggcggcg ctgggggcag cggcggcggc 1980 

accggcggca acggcggcgc tggcggcgac gccggcgacg ccggcaacgg cggcaacggc 2040 

65 aacggcaccg gcaacggcgg caacggcggc aacggcggca tcgccggcat gggcggcaac 2100 

ggcggtgccg ggacgggcag cggcaacggc ggcaacggcg gcagcggcgg caacggcggc 2160 

aacgccggca tgggcggcaa cagcggcacc ggcagcggcg acggcggtgc cggcgggaac 2220 
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ggcggcgcgg cgggcacggg cggcaccggc 
accggcggca gcggtggcac cggcggtgac 
accgcaaaca tgactgcgca ggcgggcggt 
ggcggcgggg ccggggccgg cggcggtggc 
5 caaggcggcg ccggcggcga tggcggcaac 
gacgaccccg gcggcaacgg gggcaccggc 
gcgggcatcg gcagccttgg cggcggcact 
ggtaccggcg gcgagggcgg cgaggtcggc 
aatggcggcg atggcggcac cggcggcacc 

10 ggcggcaccg gcggcaccgg cggcctcggc 
ggcggcaccg gcggcagcgg cggtgcggcc 
gcgggaggca atggcaacgg cggcaccggt 
aatggcggcg acgccgagcc cggagtgccc 
accaccggcg gcaagggtgg caccggcggc 

15 ggcggcgatg gcggcaccgg cggtggtggt 
aagggagaca ccggcagcgg cggtggcgcc 
accggcggcg ccggcggcga cggcggagcg 



ggcgacggcg gcctcaccgg tactggcggc 2280 

ggcggtaacg gcggcaacgg agcagataac 2340 

gacggtggca acggcggcga cggtggcttc 2400 

ttgaccgctg gcgccaacgg caccggcggg 2460 

ggggccatcg gcggccacgg cccactcact 252 0 

ggcaacggcg gcaccggcgg caccggcggc 2580 

ggcggcgatg gcggcaacgg cggcaacggc 2640 

ggcgccggcg gcaccggcgg tgcggccggc 2700 

ggcggcgggg acgggggcgc cggcggcacc 2760 

gacccccggg tcggcggatc cggcggcgac 2820 

ggcaatggcg gcaacggcgg caacgccggc 2880 

ggggccggcg gtatcggcgg caccggcggc 2940 

ccgggagccg gtggtgctgg cggcgccggc 3 000 

aacggcagtg gcaccggctc gggcggcacc 3060 

gggaacggcg gcaccggctg gaatggcggc 3120 

ggagacggtg gtaaggcacc agccggtggc 3180 

ggcggcaagg gcggcagcgg cggcgtctag 3240 



<212> Type : DNA 
20 <211> Length : 3240 

SequenceName : SEQ ID 529 
SecfuericeDe script ion : 



25 



30 



35 



40 



45 



50 



55 



60 



65 



Sequence 

<213> OrganismName : Mycobacterium tuberculosis H37Rv 
<400> PreSequenceString : 

atggtcatgt cgctgatggt ggcgccggag ctggtggcgg cggccgcggc 
gggattgggc aggccatcag cgcggcgaat gcggcggcag cgggcccgac 
ttggcggccg ccggtgatga ggtgtcggcg gcgatcgcgg cgttgtttgg 
caggagtacc aggcgttgag cgcccgggtg gcgacgtttc atgagcagtt 
ctgaccgcgg cfcggcagcgc gtatgcgact gccgaggcgg cgaatgcatc 
gcgctggagc agcaagtgtt gggtgcgatc aacgcgccca cacagctgtg 
ccgctgatcg gtgatggcgt tcacggggcg ccggggaccg ggcagcccgg 
gggttgttgt ggggtaatgg cggtaacggc ggttcggggg cggccggtca 
cccggcggcg cggccgggtt gttcggcaac ggcgggtccg gcgggtccgg 
gctgccggcg gtgtcggcgg atccggcggg tggttgaacg gcaacggcgg 
gccggcggga ccggcgctaa cggtggtgcc ggcggcaacg cctggttgtt 
gggtccggcg gcgccggcac caatggtggc gtcggcgggt ccggcggatt 
aacggcggcg ccggcgggat cggcggcatc ggaggtatag gcggcaacgg 
gggctgttcg ggaacggcgg cgccgggggg gccggggccg cgggcctgcc 
ggcctcaacg gcggcgacgg cagcgacggc ggcaacggcg gaaccggcgg 
cgcggcgggt tattggttgg caacggcggg gccggcgggg ccggcggcgt 
ggtggtaagg gcggcgctgg cgatccgagt ttcgccgtca acaacggtgc 
ggcggtcacg gcggcaaccc cggcgtgggc ggggccggtg gggccggcgg 
ggtgcgcacg gtgccgccgg cgccaccccc accagcggcg gcaacggcgg 
atcggcgcca ccgccaactc acccctacaa gccggcgggg ccggcggtaa 
ggcgggttgg tcggcaacgg cggcaccggc ggcgccggcg gtgccggtca 
accggcgcta ccggtaccgc cttacaaccg acgggcggta acggcaccaa 
gggggccacg gcggtaatgg cggaaatggc ggcgcccagc acggcgacgg 
ggcaagggcg gtgccggcgg tagcggcggc gccggcggaa acggattcga 
ttgggttcgc ccggtgccga tggcggtatg ggcggcaacg gcggcaaggg 
ggcaaggccg gtgatggcgg agccggtgcc gccggtgatg tgaccttggc 
ggtgccggcg gtgacggcgg caacggcggt gaagtgggcg ttggcggcaa 
ggcggtgtta gcgcgaaccc ggccctgaac ggttcggccg gggcgaacgg 
accagcggcg gcaacggtgg caacggaggt gccggcgcca cccccaccgt 
aacggcggcg ccggtggtaa cggcggccat ggcgggtcgg tcggtaacgg 
ggtgccggcg gaaatggcgt cgccggcacc ggccttgccc tcaacggcgg 
aacggcggca tcggcggcaa cggcggatcg gcggccggca cgggcgggga 
ggcggcaacg ggggcgccgg agccaacggc caagacttct ccgcgtccgc 
aatggcggac agggcggcaa cggcggcaac ggcggcatcg gcggcaaggg 
ttcgccacgt tcgctaaggc cggcaacggc ggtgccggcg gcaacggcgg 
gttgccggcc agggtggggc cggcggcaag ggcgccattc cagccatgaa 
ggcgccgatg gcaccgcacc caccagcggc ggtgacggcg gcaacggcgg 
agccccaccg tcgcgggcgg caacggcggt gacggcggca agggcggcag 
gtcggcaatg gcggcaatgg cggggccggc ggcaacggcg cggccggcca 
ccgggcccta ccagcggtga ttccggcacc tcgggcaccg acggtggggc 



ggacttgacc 60 

gacgcaggtg 12 0 

tacccacgcg 18 0 

tgtgcgctcg 240 

accgctgcag 3 0 0 

gttggggcgc 360 

tggggccggg 42 0 

ag t cggtggg 480 

cggggccggc 540 

ggccggcggg 600 

cggggccggc 660 

tgtctacggc 720 

tggcgacgcc 78 0 

gggtgccgcc 84 o 

caacggcggg 900 

cggcggcgac 960 

cggcggtaac 1020 

cctgctggcg 1080 

cgatggcggc 1140 

tggcggtcat 12 00 

tgcgggttcc 1260 

tggcggcgcc 13 2 0 

cggcgtcggc 13 80 

cgccgccacc 1440 

cggtgacggc 1500 

cgtcaaccag 1560 

gggtggggcc 1620 

caccgcgccc 1680 

cgcgggagaa 1740 

cggtgcgggt 1800 

caacggcggc 1860 

cggcggcaag 1920 

caatggcgcg 1980 

tggtgacgcc 2 040 

caatgtgggc 2100 

gggtgcgacc 2160 

caacggcgcc 2220 

cggcgggaat 2280 

agccggtacg 2340 

tggcggcaac 2400 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



ggcggggcgg 
ggtaacggcg 
cccaatgcta 
ggcgccggcg 
gccacgggca 
ggggccggcg 
aaccccggta 
ggcaaggccg 
ggcggtaacg 
ggcgcggccg 
aacggcggca 
ggcggtaacg 
ggcgccggcg 
ggcggtgccg 
ggcggcgcgg 
ggtgccggcg 
ggcggcaacg 
ggcctcggcg 
ggctacggcg 
ggcggcaacg 
aacggcggcg 
ggcaatggcg 
aacggcggta 
aatgggggca 
gtcgccggcg 
ggccacggcg 
ggtgccgcca 
aatggcggca 
agcgccggca 
aacggcggca 
gccggcggca 

ggtgg^agcg 

ggcgggggca 
tggcaacggc 
cggtgccggc 
cggcaagggc 
cgctggcgcc 



gcggcgccgg 
gccagggtgg 
acggcgcaaa 
gcaatggcgg 
ccggcggcga 
aaaacggcct 
ccggcggcaa 
gcaccggctc 
gcggcaatgg 
gcggcaacgg 
acggcggtgg 
gcggcaatgg 
gtggcggcgc 
gcggcttcgc 
gcgactttgc 
gggtcggcgg 
gcggtagcgg 
gcaacggcgg 
gtccgggagg 
gcggcgtcag 
acggcgggaa 
gtatcggcac 
gcagcaaaag 
ctggcctcaa 
tgtccttcgg 
gcgacggcac 
gcggctcagg 
acggcggcaa 
acggcggcca 
actccggcaa 
acggcggcgg 
gcaccacggg 
gcctgagcac 
ggcgctggct 
ggcaacggtg 
ggcgacggtg 
aacggcggtt 



cggaacactg 
catcggcggc 
cggcgagaac 
cgcgggcggc 
cggcggcaac 
aaacagcggg 
cggcggcaac 
cfctgacgggc 
cggcaacggc 
cggcaacggc 
ggcgctgaac 
cggtgccggc 
cgttgacgtc 
.tagcaccggg 
tagcggtgtt 
cttcggcggc 
cggcgacggc 
cgtctccgag 
ccccgaaggc 
caccacgggc 
cgtcggtttg 
cgacgcgggc 
cacgaccacc 
cggcgcgggc 
caacgctgtg 
gacgggcggc 
cgtcgtcaac 
cggctccgcg 
cggcggcggt 
cagcaccggc 
cacttccagc 
cggcgccggc 
aggccagtcc 
cggccggcaa 
gcggtggcga 
tcggccctgg 
ccagcaacgg 



gccggccacg 
gccggcgaga 
ggcggtagcg 
aaggcgcagg 
ggcggcgatg 
gccatgctgc 
ggcggcaacg 
ttggacggca 
ggcaagggcg 
ggctccggcc 
caggccggct 
atgatcaacg 
gccgcgacaa 
ttgggtggcc 
ggcggtgtcg 
cagggcggca 
ggtggcggca 
actgggtttg 
aatggcggcc 
ggcgacggcg 

ggcggtgacg 

ggtgccggag 
ggcaacgccg 
ggtgctggcg 
ggcggcgacg 
gccggcggca 
gtcaccgccg 
ggcgccggcg 
gccaccggcg 
gtcgcgggct 
gccgccggcc 
gcggccggcg 
ggcggcccac 
cggcggcgcc 
tgccggcaac 
ctccaccggc 
caacgctcgc 



gcggcaacgg 
gaggcgccga 
gtggtaacgg 
cggccgggta 
gcggcaaagc 
cgggcggcgg 
ccggcgtcgg 
ccgacggcat 
gcaccgccgg 
tcaacggcgg 
tcttcggcac 
gcggcctcgg 
cgggcggcgc 
caggcggcgc 
gcggcgccgg 
tcggcgggga 
tttccttagg 
gcggcgccgg 
tcggcggcaa 
gcgccggcgg 
ccggctccgg 
gggccggtgg 
gctccggtgg 
gggccggcgg 
gcggcaacgg 
agggcggcaa 
gccacggcgg 
gccagggcgg 
gcgacggcgg 
tggccggtgg 
acggcggcag 
gcaacggcgg 
ggcggcagcg 

ggtggtgccg 

gccggcfccag 
ggcgcgggcg 

ggtggcaacg 



tggtaagggt 

cggcgccggc 

tggcgacggc 

caccgacggc 

cggtgacggc 

caccgtagga 

cggcaccgga 

cacccccaac 

caacgggagc 

tgacgccggc 

gggcggcaaa 

cggcttcggc 

tggcggcaat 

cggcggtccc 

cggggacggc 

agggcgcaca 

cggcaacggc 

cggcaacggc 

cggcggcgcc 

caagggcggc 

cggcgcgggc 

cgctggcggt 

tgccggcggt 

caacgcgggt 

cggcaacggc 

cggcagcagc 

caacggcggc 

tgccggcggc 

caacggcggc 

tgccgccggc 

cggcggcagc 

cgccggtgct 

gtggtgccgg 

gtggcaacgg 

gcggcaatgg 

gcaagggcgg 

ccggtaa 



<212> Type : DNA 

<211> Length r 4617 

SequenceKTame : SEQ ID 530 
SequenceDescription : 

Sequence 



65 



<213> Organi stnName : Mycobacterium tuberculosis H37Rv 
<400> PreSequenceString : 

atgtccttcg tggtcacagc accgccggtg ctcgcgtcgg cggcgtcgga 
atcgcgtcca tgatcagcga ggccaacgcg atggcagcgg tccgaacgac 
cccgccgccg ccgacgaggt ttcggcggcg atcgcggcgc tgttttccag 
gactatcaaa cgctgagcgt ccaggtgacg gccttccacg tgcagttcgc 
accaatgcgg ggcagctgta tgcggtcgtc gacgtcggca atggcgtgct 
gagcagcagg tgctgggtgt gatcaatgcg cccacccaga cgttggtggg 
atcggcgatg gcacccacgg ggcgccgggg accgggcaga acggtggggc 
ttgtggggca acggcggtaa cggcgggtcc ggggctcccg gacagccggg 
ggtgatgccg gcctgttcgg ccacggcggt catggcggtg tcggggggcc 
ggtgccgctg gcaccgcggg cctgcccggg ggcaacggcg ccaacggcgg 
atcggcggcg ccggcggcgc cggcggcaac ggcgggctgc tattcggcaa 
ggcggccagg gtggctccgg cggacttggg ggctccggcg ggacgggcgg 
gctgccggtc ccgccggcgg caccggcggc atcgggggca tcggcggcat 
ggcggggtcg gcggccacgg ctcggcgttg ttcggccacg ggggaatcaa 
ggtaccggcg gcatgggtgg ccagggcggt gctggcggca acggctgggc 
atcacggtcg gcattggtga gcaaggcggc cagggcggcg acgggggagc 
ggcgggatcg gtggttcggc gggtgggatc ggcggcagcc agggtgcggg 
ggcgacggcg gccagggcgg cgccggcggt agtggcggcg ttggcggcgg 
gccggcggcg acggcggcgc gggcggcatc ggcggcactg gcggtaacgg 
ggggccgccg gcaatggcgg taacggcggc cgcggcggcg ccggtggcat 



tctgggcggt 
ggcgttggcg 
ctacgcgcgg 
gcagacattg 
gttgaagacc 
tcgtccgctg 

gggcggaatc 

cggccggggc 
gggcatcgcc 
aagcggcggc 
cggtggtgcc 
cgcgggcatg 
cggcggcgcg 
cggcgatggc 
cgctgagggc 
cggcggcgcc 
tgggcacggc 
cggcgcaggc 
cagcatcggc 
ggccaccgcg 



2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
^3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4617 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
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ggaagtgatg gcggcaatgg cggcggcggc ggcaacggcg gcgtcggtgt tggcagcgcc 1260 

ggaggggccg gcggcaccgg cggtgacggc ggggcggccg gggcgggcgg cgcgccgggc 132 0 

cacggctact tccaacagcc cgcgccccaa gggctgccca tcggaaccgg cgggaccggc 13 80 

ggcgaaggcg gtgccggcgg cgccggtgga gacggcgggc agggcgacat cggcttcgat 1440 

5 ggcggccggg gtggcgacgg cggcccgggc ggtggcggcg gcgccggcgg tgacggcagc 1500 

ggcaccttca atgcccaagc caacaacggc ggcgacggtg gtgccggcgg tgttggggga 1560 

gccggcggca ccggcggcac gggtggggtc ggggccgacg ggggtcgcgg gggggactcg 1620 

ggccgcggcg gcgacggcgg caacgccggc cacggcggcg ccgcccaatt ctccggtcgc 1680 

ggcgcctacg gcggtgaagg tggcagcggc ggcgccggcg gcaacgccgg tggcgccggc 1740 

10 accggtggca ccgcgggctc cggcggtgcc ggaggtttcg gcggcaacgg tgccgatggc 18 00 

ggcaatggcg gcaacggtgg caacggcggc ttcggcggaa ttaacggcac gttcggcacc 1860 

aacggtgccg gcggcaccgg cgggctcggc accctgctcg gcggccacaa cggcaacatc 1920 

ggcctcaacg gggccaccgg cggcatcggc agcaccacgt tgaccaacgc gaccgtaccg 1980 

ctgcagctgg tgaataccac cgagccggtg gtattcatct ccttaaacgg- cggccaaatg 2040 

15 gtgcccgtgc tgctcgacac cggatccacc ggtctggtca tggacagcca attcctgacg 2100 

cagaacttcg gccccgtcat cgggacgggc accgccggtt acgccggcgg gctgacctac 2160 

aactacaaca cctactcaac gacggtggat ttcggcaatg gccttctcac cctgccgacc 2220 

agcgttaacg tcgtcacctc gtcatcaccg ggaaccctgg gcaacttctt gtcgagatcc 2280 

ggtgcggtgg gcgtcttggg aatcgggccc aacaacgggt tcccgggcac cagctccatc 2340 

20 gttaccgcga tgcccggcct gctcaacaac ggtgtgctca tcgacgaatc ggcgggcatc 2400 

ctgcagttcg gtcccaacac attaaccggc ggtatcacga tttctggagc accgatttcc 2460 

accgtggctg ttcagatcga caacgggccg ctgcaacaag ctccggtgat gttcgactcc 2520 

ggcggcatca acggaaccat cccgtcagcc ctcgccagcc tgccgtccgg gggattcgtg 2580 

ccggcgggaa cgaccatttc ggtctacacc agcgacggcc agacgctgtt gtactcctac 2640 

25 accaccaccg cgacaaacac cccatttgtc acctccggcg gcgtgatgaa caccgggcac 2700 

gtccccttcg cgcagcaacc gatatacgtc tcctacagcc ccaccgccat cgggacgacc 2760 

acctttaact ga 2772 
<212> Type : DMA 
<211> Length : 2772 

30 SequenceName : SEQ ID 531 

SequenceDescription r 

Sequence 



35 <213> OrganismName : Mycobacterium tuberculosis H37Rv 
<400> PreSequenceString : 

ttgatcggca acggcggggc cggcgggtcc ggggcgcccg gcgccatcgg tggggccggc 60 

gggcccgcgg ggttgatcgg tgtcggaggt gccggcgggg ccggtggaga ctccgcggtc 120 

gcgggtgtca tcggaggggc cggtggggca ggcggggctg ccctgctgtt cggtgccggt 180 

40 ggggccggcg gggccggggg ttccggcggt tccggcgcag ctggtggggc cggtggcgcc 240 

ggtggggccg gcgggctgtt cgccagcggc ggcagcggcg ggttcggcgg gttcgcatcg 3 00 

acgggcaccg gtggggccgg cggcaccggt ggggctggtg ggttgttcgc cagcggcggg 3 60 

gtcggcggta ctggcggggg agccgggtcc ggcggtaccg gtggggttgg tgggacgggt 420 

ggggccggag ggctgttcgc tagcggcggc gctggcgggg ccggcgggtc cggcggtacc 48 0 

45 ggtggggctg gtgggacggg tggggccggc gggctgttcg gagccggtgg cgctggcggg 540 

ctcggcgggc aaggcaacca caccggcggg cacggtgggg ccggtggcag cgccggcctg 600 

ctcgcccttg gcgacggcgg cgctggcggg gccggcgggg ccgctaccac cggaaccggc 660 

ggggccggcg gggcgggtgg caaggccggc ctgctgttcg gctccggtgg ggccggtggg 720 

tccggtgggg ctgccggcac cttcggtgac accggtaact ccggcggggc cggtggggcg 780 

50 ggtggcaagg ccggcctgct gttcggctcc ggtggggccg gtgggtccgg cggcgctggg 840 

ggcttcgcca acggctctac cggcggtgcc ggcggggccg gcggcggggc cgggctgatc 900 

ggcaacggcg gcaacggtgg cagcggcggc acgtcggttg ccaccggggg ggccgggaac 960 

ggcggtgccg gcggcgccgg cggcggggcc gggctgatcg gcaacggcgg caacggcggc 1020 

agtggcggaa tgggcgatgc cccgggcggc accggcgtcg gcggcatcgg tgggctgttg 108 0 

55 ttgggtttgg acggcgccaa cgccccggcc agcaccaacc cgctgcacac cgcgcagcag 1140 

caggcgttgg ccgcagtcaa cgcgcccatc caggccgtga ccgggcgccc gctgatcggc 12 0 0 

aacggcgcca acggcgcccc gggcagcggg gcccccggcg ggcacggcgg gtggttgttc 1260 

ggcggcggag ggaccggcgg gtccggcgtc agcggcgggg cgggcggaga tggcggggcc 132 0 

ggcgggatct tgttcggcgc cggcggggcc ggcggcgcgg gcggggccgt cacgggaacc 1380 

60 ggcgccaccg gcgggtccgg tggggccggc ggtggagcct tgctgtttgg ggccggtggg 1440 

gccggtggag ccggcgggtc cagcgggatt ggcgggttcg ccgcgggcgg ggccggtggg 1500 

cccggagggg ccggtgggct gttcaacggc ggcggggccg gcggggccgg cgggtccggc 1560 

gtcagcggcg gggctggcgg ggagggcggg gccggcgggg ccggtggcct gttcgccggt 1620 

ggcggggccg gcggggccgg cggatcgggc aacaacgtcg ggggggccgg cggggccggt 1680 

65 ggggtcggtg ggctgttcgg ggccggcggg gccggcggat ccggcggcgg cggtagcgtt 1740 

gctggcgaca gtggggccgg cggcaacgcg ggcttgctcg cccccggtct cgccggcggt 1800 

gccggcggtg gcggcgggca gggttttgac accggcgggg ccggcgggcc cggcggcgac 1860 
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gccggcctgc tggtcggctc cggcggggtc ggaggtgccg gcggattcgg cctcactacg 1920 

ggtgggcctg gggcggccgg cggcgacgcc ggcctgctgt tcggctccgg cggcgctggc 198 0 

ggggccggcg gctccggccg aaccgacctc ggcggcgctg gcggagccgg cggcaaggcc 2 040 

gggctgatcg gcaacggcgg taacggcggg gccggcgggg ccggcgggaa cggcggcggg 2100 

5 gacggcgggc ccggtggagc cgccttcggg ctcggtaacg gcggcaacgg cggcaacggg 2160 

gggaccggca cgtccgcggg cagccccggt gccggcggcg ccggtggttc gctgatcggc 2220 

gcggaggggc tgcccgggct gctgccctag 2250 
<212> Type : DNA 
<211> Length : 2250 
10 SequenceName : SEQ ID 532 

SequenceDe script ion : 

Sequence 

15 <213> OrganisraName : Mycobacterium tuberculosis H3 7Rv 
<400> PreSequenceString : 

atgtcgtttg tgattgctgc gccggaggcg ttggtcgcgg tcgcttcgga tctggcgggc 60 

attgggtcgg cgctggcgga ggccaacgcc gcggcgttgg ccccgacgac ggcgttgttg 120 

gccgcgggtg ccgatgaggt gtcggcggcg atcgcggcgc tgtttggcgc gcacgggcag 18 0 

20 gcgtatcaga cggttagcgc ccaggcgtcg gcgtttcatg cccagtttgt gcaggcgttg 240 

actggcggcg gcggggcgta tgcggctgcc gaggccgcca acgtctcggc ggcgcagagc 300 

accgaccagc ggctgctcga tctgatcaat gggcccaccc aggcgttgtt ggggcgtcca 360 

ctgatcggtg atggcgccaa cggcgggccg gggcaagacg gcgggcccgg ggggttgctg 42 0 

tacggcaacg gcggcaacgg cggcactagt accaccgccg gggtggccgg cggcaacggt 480 

25 ggcgccgccg ggctgatcgg caacggcggg gccgggggcg gcggcggggc cggcgcggcc 540 

ggcggcaatg gcggtgcggg cgggtggctg tatggcaacg gcggcgccgg cggggccggt 600 

gggacatcgg tgatacccgg tgtcgccggc ggcaatggcg gggctggcgg gtccgcggga 660 

ctgtggggta ccggcggggc cggtggcgac ggcggcaacg gccggtcggg gccagtcaac 720 

gtcgccggca gcgcgggcgg caacggtggc gctggtggcg ccgccgggtt attcggtgac 780 

30 gccggggccg gtggcaacgg cggcaagggc ggtgctggcg gcgccgcctt tagcattaac 840 

ttcaccgcag gcgatggcgg tgcgggaggt gccggtgggt ccggcggcca cgcattgctg 900 

tggggcgccg gcggagccgg gggtaacggc ggatccggcg gcacgggggg tgccggcggc 960 

agcaccgctg gcgctggcgg caacggcggg gccgggggtg gcggcggaac cggtgggttg 102 0 

ctcttcggca acggcggtgc cggcgggcac ggcgccgccg ccggaaacgg cttagccgcg 10 80 

35 ggtaafcggcg tcagcagcag cggcggcggc ggtgccggtg ggaccggcgg ggccggtggg 1140 

gacggtggcg ccggcggggc cggaggcaac gccaggctgt ggggcgtcgg tggcgccggc 1200 

ggggccggcg. gggacggtgg cgccggcggg gccggcggca aaggcggctc tggcctcagc 1260 

ggtaacgcca acggcggggc cggcggcgac agcggccgtg gcggcacggg cggcgccggc 132 0 

ggcgagggcg gcgccgccgg gctgctggtg ggcaccggcg ggcacggcgg tgacggcggg 1380 

40 gccggcggcg ccgccgtcaa gggcggtgac ggcggggccg ccgccggcac gggcatcgcc 1440 

ggcgctggcg gccgtggcgg cgcgggcggc agcggtggca gcggtggtga cggcgggggc 1500 

ggggccgccg gccccgccgg gtggctgttc ggcgatggcg gggctggcgg gaacggcggg 1560 

gccgcggccg ccggcggcgc cggcggccaa gccggcggtg gcggcgggaa cggcggcaat 1620 

ggcggcaacg gcggcaatgg cggcaatggc ggcaacggcg ccaccggggg gtggctgtac 1680 

45 ggcaacggcg gggccggcgg ccagggcgcc accgccggag ccggcggagc cggcgctaac 1740 

ggcgtcagca gcaccaatgg cggcggcacc ggcggcaacg gggggatcgg cgggaccggt 1800 

gggtccggcg gggccggtgg caacgccggg ctgttgggcg tgggcggcgc cggcgggcac 18 60 

ggcgcctccg gcggcgccgg cgataggggc ggcgctggcg gtaccgggtt cataagcagt 1920 

gacggcggtg ctggcggtga tggcggtgat ggcggcaacg gcggggccgg cggcaccggt 1980 

50 gggctgttgt tcggtgccgg cggcaatggt ggccccggcg ggtctggcgg tgccgccgat 2040 

attggcggca acggcggcgc cggtaacggc gggggcaccg acgggaacgg cggtaatggc 2100 

gggtccggcg gcggcgccgg cagcggcggt gacggcggcg gggctggcgg caacggtgcg 2160 

tggctgttcg gcaatggcgg cgccggcggg ggcggcggaa aaggcggcaa cggtgccggc 2220 

ggcgggcttg gcggcggttc attcggcctc cccggcctga acggcagcgg cggcgacggt 2280 

55 ggcgacggcg gtaacggtgc ccccggcggg gtgctgtatg gcaatggcgg cgccggcggc 2340 

caggggtcaa gcggtggcat cggcggcccc ggcgccaccg gcggtgccgg cggcaaaggc 2400 

ggtgatggtg gcgatgcgca gctgatcggc gacggcggca atgggggcaa cggaggcgcg 2460 

ggcggcaccg ggggcacccc ggggcccggc ggacccggcg ggtccggcgg gcttggaggc 2520 

ctgctgttcg gccaaaccgg cacggctggc gtgtcgccgt ag 2562 

60 <212> Type : DNA 

<211> Length : 2562 

SequenceName : SEQ ID 533 
SequenceDescription : 

65 Sequence 



<213> OrganismName : Mycobacterium tuberculosis H37Rv 
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<400> Pre Sequence St ring : 

atgtcctatc tcgtcgtggt gccggagttg gtcgcagcgg cggcaacaga tttggcgaac 60 

atcggttcgt cgattagtgc agccaacgcg gccgcggcgg caccgaccac ggcactggtc 120 

gcagccggcg gcgacgaggt atcggcggcc atagccgcgt tgttcggagc gcatgctcgg 180 

5 gcatatcaag cgttgagtgc ccaggcggcg atgtttcatg aacagtttgt ccgggccctc 240 

gccgccggcg gtaactccta cgccgtcgct gaggcggcaa ccgcgcaatc ggttcagcaa 300 

gatctgctca acctgatcaa tgcgcccacc caggcgctgt tggggcgtcc gctgatcggc 360 

aacggcgcca acgggctgcc gggtacgggc cagaacggcg gcgacggcgg gattctgtac 42 0 

ggcaacggcg gcaacggtgg gtccggcggg gtcaaccagg ccggtggcaa tggcgggaat 48 0 

10 gctgggctgt ggggcaatgg cggatccggc ggagccggcg ggaacgccac cactgccggc 540 

cgcaacggct tcaacggggg cgccggggga agcggcggtt tgctgtgggg caatggcggt 60 0 

gccggcgggg ccggtgggaa cggcggtccg gctccgctcg tgggcggggt gggcaccacc 660 

ggtggcgccg gcgggaacgg cggcggcgcc gggttgttct acggtttcgg cggcgccggt 72 0 

gggaaqggcg ggatgggcgg ggtggcaccg agcaccggcc cctcgatggg catcctcccg 78 0 

15 gccggcggtg tcggcgggcc tggtggctcc ggcggggcga gcgcgcttgc cttcggctcc 840 

ggcggcgtcg gcggtgccgg tggcttgggc gggccgaccg atggcaccgt ccagggggtg 90 0 

ggcggcttcg gcggtcaggg cggcaacggc gggcagagcg gcttgttgtt tggcaacgcg 960 

ggagccggcg gggcaggcgc tgccggcgga gccggcaccg gcgacaccga gagcttcggc 102 0 

ggccacggcg gggccggcgg tgatggcggc gctgttggct tgatcggtaa cggcggggcc 1080 

20 ggcggcaccg gatctcccgg cgctgtggtg ggtggtaacg gcggcgtcgg tggtctgggt 1140 

ggcgccggca gtcccggggg tctgttgtac ggcaccgggg gggccggcgg caatggcgga 12 0 0 

ccgggtggtg acggtggtac tggcgcgacg gtgggctttg ccggctccgg cggtttcggc 1260 

ggtgcggggg gcatcgccca gctgtttggc acgggtggca tgggtggtag cggcggtggt 132 0 

ataggcgctg gcaccacgac cgtggtgccg cccgacgtcg ccccggtggg tggcacaggc 138 0 

25 ggcaatggcg gtcgcgccgg gctgctgttg ggtgtgggtg gcatgggcgg taatggcggt 144 0 

gccaccagcg tcggcgggac gctctacgcc gccggtggaa acggcggcga cggcgggttg 150 0 

gtgtggggca acggtggcac cggcgggagc ggtggcgccg gcggggcggg cagcgtcggc 1560 

aacggcggtg cgggtggcaa cgcggcactg ctgttcggca acggcggggc gggcggggcc 1620 

ggcggcgccg gcggcatcgg tgccggcgga gccggcggct tcggcgcggt tctgtttggc 1680 

30 aacggcgggg ctggcgggag cggtgccccc ggtggcatcg gcgccggtgg caatggcgga 174 0 

aacgcgctgc tggtcggcaa cggcggcaac ggtggggcag gtaccggtgg ggctgctggc 18 0 0 

ggtgccggtg gctcgggcgg gttgctattc ggccaaaatg ggatgcccgg gccgtga 1857 

<212> Type : DUA. 
35 <21X> Length : 1857 

SequenceName : SEQ ID 534 
SequenceDescription r 

Sequence 

40 

<213> OrganismName : Mycobacterium tuberculosis H37Rv 
<400> PreSequenceString : 

atgaattttt ctgtactgcc gccggagatc aattcagcgc tgatattcgc cggggcaggg 60 

ccggaaccga tggcggcggc cgcgacggcc tgggacgggt tggccatgga attggcctcg 120 

45 gccgcagcct ctttcggctc agtgacatcc ggactcgtgg gcggggcgtg gcagggcgcg 180 

tcgtcgtcgg cgatggcggc agcggcagcc ccctatgcgg cgtggcttgc cgcggcggcg 240 

gtccaggccg agcagacggc cgctcaggct gcggcgatga tagccgagtt tgaagcggtc 3 00 

aagacggcgg tggtgcagcc gatgctggtg gcggccaacc gtgccgacct ggtgtcgctg 3 60 

gtgatgtcga acctgtttgg acagaacgct ccggcgatcg ctgccattga agccacgtac 420 

50 gagcaaatgt gggctgccga tgtgtcggcg atgtctgcct accatgccgg ggcatcggcg 48 0 

atcgcctcgg cgctgtcccc gttcagtaaa ccgctgcaga acctggctgg cttgccggct 540 

tggttggcca gcggcgcgcc tgcggccgcc atgaccgcag ccgcaggcat accggcgctt 600 

gcgggcggac ccaccgccat caacctgggc atagccaacg tcggcggtgg caacgtcggc 660 

aacgccaaca acggccttgc caacatcggc aacgccaacc ttggcaacta caatttcggg 720 

55 tccggaaatt tcggtaactc caatatcggc tcagcaagcc tgggtaataa caacatcggc 78 0 

ttcgggaacc tcggcagcaa caatgtcggc gtgggaaacc ttggcaatct caacaccggg 840 

tttgccaaca ccggcttggg caacttcggc tttggcaaca ctggcaacaa caacatcggc 900 

atcggtctta ccggcaacaa ccagatcgga atcggcgggc tcaactcggg caccgggaat 960 

ttcggattgt tcaactcggg cagcggaaac gtcggcttct tcaactccgg caatggaaac 1020 

60 tttggcatcg gaaactcggg taatttcaac accggtggct ggaattctgg acacgggaac 1080 

acgggcttct tcaatgcggg ctcgtttaac accggtatgt tggacgtcgg caacgcgaac 1140 

acaggcagcc tgaacaccgg cagttataac atgggcgact tcaatccggg gtcgtccaac 1200 

accggcacgt tcaacacggg aaatgctaac acgggtttcc tcaacgccgg aaatatcaac 1260 

actggtgtct tcaatattgg ccacatgaat aatgggctgt tcaacacggg tgacatgaac 1320 

65 aatggcgtct tctaccgggg cgtggggcag ggcagcctgc agttcagtat tacgacacct 1380 

gatctgactc tgccgccgct gcaaataccg gggatatcgg ttcccgcctt cagtctgccg 1440 

gcaataacgc tgccgtcgct gaacatcccg gccgccacca caccggccaa catcaccgtc 1500 
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ggcgccttca cgcctgcccgg gttgacgttg 
ccagccaaca tcaccgtggg tgccttcagc 
atcccggccg ccaccacacc agccaacatc 
acgttgccgt cgttgaacat cccggccgcc 
ttcagcctgc ccgggttgac gttgccgtcg 
aacatcaccg tcggcgcctt cagcctgccc 
gccgccacca cacccgccaa catcaccgta 
ccttccgtag ccattccgcc ggtgacggtc 
ccgccattgc agattccgga agtaactatt 
atcggtggct fctagtctacc tgcgatacat 
gtgggccaat ttggcctgcc ctccataggc 
acagtaccgg cttttggaat accctttacc 
cagccgcccg cjcggcgggct tagtactttc 
gacttaccac aattggtggt tcacccatac 
ttctttctgc ccgccttcaa catacccggg 
ttcaccctgc cgcagatcac caccccagct 
atcggcgtgg cjcggcttcac tctgccgcag 
ctaaccatca actcgatcgg cgtcggcggg 
atcaccaccc caccgctgac catcgacccc 
atcaccaccc cacccatcac caccccaccg 
ttcaccctcc cccaaatcac caccccaccc 
atcggcgtgg cjgggcttcac cacgcccccg 
accacgatcg <gggccttcgc gatccccggg 
ccttcgtcgg gcttcttcaa ttccggtgcg 
tcgggcctct cgggttggtt caacaccaac 
cagaacttcg gcgggctatc ctcgggcttt 
gccaacaggg gcatcctgcc gttctcggta 
ggcaccaacc tggcgggttt cttccaaggc 
<212> Type z DNA 
<211> Lengtli : 3162 

Sequen cetTame : SEQ ID 53 5 
SequernceDescription : 



ccgtcgttga 
ctgcccgggt 
accgtcggcg 
accacaccag 
ttgaacatcc 

gggttgacgt 

agcggctttc 
ccgcccatta 
ccgcagctga 
actcaaccga 

tgggatgttt 

ctacaattcc 
accaatggcg 
acattgaccg 
atcgatgtcc 
atcaccaccc 
atcaccaccc 
ttcaccctgc 
atcaacctca 
ctgaccatcg 
atcaccaccc 
ctcaccgttc 
gggccgggct 
ggcggcaact 
ccggccgggc 
tccaaccttg 
gccagcgtcg 
accacgtcct 



acatcccggc 
tgacgttgcc 
ccttcagcct 
ccaacatcac 
cggccgccac 
tgccgtcgtt 
agttgcctcc 
cggtgggtgc 
cgatacccgc 
taacggtcgg 
tcctaagcac 
agaccaatgt 
ccctcatctt 
gccctattgt 
ccgctatcaa 
cggagttcgc 
aggaaatcat 
cgcaaatcac 
ccggcttcac 
accccatcaa 
caccgctcac 
ccggcatcca 
acttcaactc 
cgggcttcgg 
tgttgggcgg 
gcagcggcgt 
tttccggctt 
aa 



cgccaccaca 1560 

gtcgttgaac 162 0 

gcccgggttg 1680 

cgtcggcgcc 1740 

cacaccagcc 18 0 0 

gaacatcccg 1860 

gctgagtatt 192 0 

ttttaatttg 1980 

gggtatcaca 2040 

ccagattggc 2100 

acctaggata 2160 

gcctgcgctt 2220 

cggtgagttt 22 80 

catcggttca -,,2340 

cgtcgatggc 24 0 0 

gatccctccg 24 60 

caccccggag 252 0 

caccccaccc 2580 

cctcccccaa 2640 

cctcaccggc 2700 

catcgagccg 2760 

cctgcccagc 2820 

gagcaccgcg 288 0 

caacaacggc 2940 

ctcgggctat 3 0 00 

ctcaggcttc 3060 

tgccaatatc 3120 

3162 



Sequence 

<213> Organd-smName r Mycobacterium tuberculo 
<400> PreSecjuenceString : 

atgttgtatg tagttgcgtc acccgacttg atgaccgcgg 

attggttcgg cgatcagcac ggcaaatggt gcggcggcac 
gccgcggccg crcgacgaggt gtccacgcag atcgcggctc 

agctaccaaa ccctcagcac ccaggcagcg gcgtttcata 

accacggccg cggcttccta cgccagcgta gaggccgcca 

gcgctagacg fcgattaatgc gcccgcccag acactgctcg 

ggcgccgacg gatcgacacc ggggcaggcc ggcgggcccg 

ggcggtaatg gcgccgccgg tgggcccaac caggccggcg 

ttgatcggca acggcggggc gggcggcgcc gggggtgttg 

ggcacgggcg gcctgctatt cggcaacggc ggggccggcg 

gcaggtatca acggcggcag cggcgggcag ggaggccacg 

ggccagggcg gtgccggcgg gccaggtggc accggcgcca 

cccaccccca tcggcaccgc agcgcctggc agcgacggcg 

ggtaacacgg acctcaccgg cggcgccggt ggcgacggca 

aacggcggca acggcggtac cggcggcgca gctaggaact 

tcctttggtg gtgccggcgg cgccggaggc gacggcgcca 

ggcggggaag ccctcaccga aggcggtgcc accgccgtta 

ggtaacgccg aggcttccgg cggcgccggc ggcaacggcg 

gccaccacca gcgtgaccgg gggtaacggc ggtaacggtg 

gcgccgggcg gcgctggcgg cagcggtggc gtcggcggtg 

ctggccggca acggcggcac cggcggtgcc ggtggcaacg 

gcccccggcg gtgccggcgg cgccggcggc aaagccgaca 

aatgccaccg taaccggggg caatggcggg acaggcggag 

accggggggg ctgggggtgc cggaggtcta ggtggtcacg 

attggcaacg gcggcgccgg tggcgctggc ggcctcggcg 

gcgggcggtg agggcggtgc cggcggcgcc ggaggcgaag 

accaactccg ccggcggtga cggaggggcg ggcggtactg 

ggtgccggcg gagcccccgg cctcggtggc gcgggcgggg 

cagtcgggca gcaccggcgg cggtggcgcc ggcggtgccg 

ggcgcgggcg gcagcggcgg tgcgggtggc catggcgaca 



sis H3 7Rv 

cggctaccaa 
tcccgactgt 
tattcggagc 
gtcggtttgt 
acgcgtcgcc 
gacgtccgct 
gcgggttgct 
gcgccggcgg 
gcgcggtcgg 
ggcaaggcgg 
gtggcaacgc 
tgggcgtcgc 
taaatcagat 
atgccggcag 
catctggtgg 
acggcggcga 
gtggtgctgg 
gcaaaggtgg 
gcaatggcca 
acggcggccg 
gcggtaccgg 
tcgccaacag 
acggcggcag 

ggggtgcagg 

gtgcgggcgg 
ctattcccgg 
gcggcaatgg 
ccggcggatg 
gtggtgccgg 
ctacctccgg 



tctggcggag 
tgaggtggtg 
gcatgccagg 
gcaggcgttg 
acttcaggtt 
aattggtaac 
gtacggcaac 
caacgccggc 
cggtaaacgc 
gctcggcctc 
catcctgttc 
cggcaccaat 
tgggaacggt 
caccaccgtg 
taccggtaac 

cggtggcgct 

tggtaaggga 
ctttgctcag 
cgacagtaac 
tggcggcctg 
tggcgccggt 
cctcggcgac 
cgcgctgggc 
cgggctgctg 
cgccggcggt 
cggggcgtcc 
cggtgacggc 
gttgatcggc 
aggtgccggt 
caagaacggt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
•960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
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tcgtcfcggca ccgcgggctt cgacggcaac cccgggcagc ccggctga 1848 
<212> Type : DNA 
<211> Length : 1848 

SequenceName : SEQ ID 536 
5 SequenceDescription : 

Sequence 

<213> OrganismName : Mycobacterium tuberculosis H3 7Rv 
10 <400> PreSequenceString : 

atgcattact cagtgttgcc gccggagatc aactcggcct tgatcttcgc cggggcgggc 60 

tccggaccga tgctggcggc ggcgtcggcc tgggacgggc tggcaaccga attagcctcg 12 0 

gctgctgtct ctttcggctc ggtgacagcc gggctggtcg gcgggtcgtg gcagggtcgg 180 
tcatcggtgg cgatggcagc ggcggcagcc ccgtatgcgg ggtggctggc cgcggcggcg - -.240 

15 acccaggccg agcaggcggc cacccaggcc caggtgatgg tggccgagtt cgaggctgtg 3 00 

cggctggcga tggtacaacc ggcgctggtg gccgccaacc gttccggcct catatcgctg 3 60 

gtgatatcga acctttttgg tcaaaacgct cccgcgatcg cggccgccga agccgcatac 420 

gaggacjatgt gggctctgga tgtatcggcg atggcggcct accattccgg ggcgtcggcg 48 0 

gtcgctgtgg cgctaccggc attcgccctc ccgctgcggc ttccggcggg tctggcggcc 540 

20 gggcccgcgg ccgtggtgac cgcgctcacc acggccgtgg gcatgccgac ttttgccggc 6 00 

cgggcgatcg ccgctagcct eggcttggcc aacgtcggtg gtggcaacct cggcaatgcc 660 

aacaafcgggc tcggcaacat cggcaacgcc aaccttggca acaacaatct ggggtccggc 72 0 

aacttcggta gcttcaatat cggctcggcc aacctaggtg gcaacaacat cggcatagga 780 

aacgcgggcg ccaacaactt cggacttgca aacctgggca atttgaacac gggattcgcc 840 

25 aatgcaggca tcggcaactt cggaattgcg aacaccggca acaacaatat cggcaacggc 90 0 

ctgacfcggaa acaaccaaat cggcattggc ggactcaatt ccggcaacgg taacgtcgga 960 

ttattcaacg cgggtagcgc caatatcggt ttcttcaact ccggcaatgg caactttggc 1020 

atcggcjaact ccggtaactt cagcactggc ctgttcaacc ccggacacgg caacaccgga 10 8 0 

ttcctgaatg cgggctcttt caatacgggc atgttcgacg ttgggaacgc gaacaccggc 1140 

30 agcttcraacg tcggccacta caacttcggt gccttcaacc cgggcccgtc gaacacgggt 12 00 

accttcraaca cgggcggcgc caacaccggc tggttcaaca caggaagcat caacaccggc 1260 

gccttcaaca taggcgacat gaataacggc ttgttcaaca cgggcgacat gaacaatggt 1320 

gtctttitacc gtggtgtggg ccaaggcagc ctgcagttcg ccatcaccag ccctgatttg 13 80 

acgctfcccgt ctctggaaat acccggaatc tcggttcccg cgttcagcct gcccgcgata 1440 

35 accttgccgt cgtfcgacgat tccggcggtg acgacgccgg ccaacgttac cgtgggtgcg 1500 

tttgatttgc cggggttgac ggtgccgtcg ttgacgattc cagcggcgat gacgccagct 1560 

aacatcracgg tgggtgcgtt tgatttgccg gggttgacgg tgccgtcgtt gacgattcca 1620 

■ gctacaacga caccagccaa catcacggta ggtgcgttta acttgcctca gttgagtatt 1680 

ccgtcggtga cggtfcccgcc gatcacgatt ccggctggca cagcgctagg tgcgttcaat 1740 

40 ctgccg-acgc tgagtattcc gtcggtgacg gttccgccga tcacgattcc ggctggcacc 18 00 

actgtcggcg gatttacgct acccacgata cacaccccgt taataagtac accccaaata 18 60 

agtataggcg gctttagcac tcccggcata gccacgcaag caaattctgg tgtcatcaat 1920 

cttcccacct ttagccttaa cggcattacg ataactaatt tggtggtgtt cattccgaac 1980 

aacatcactg ccttgcaaac caatatgccc ggggtattcc cgcagattgg cggcttcgct 2040 

45 aatacacctc ctgcctttat taatactggg accattaccg tgggtggagg tcaaatcaac 2100 

ggcgtcggct tctcgatcgg cgcaatcaac gtcaccccct tcaccctccc caacgtcgtc 2160 

atccaaccgt ggtccctcgg ggggatctcg gtcgacgggt tcaccctgcc agagatcagc 2220 

acccaagaat tcaccactcc ggcgttgacg atcagtccga ttggtgtcgg tgcattgagc 22 80 

ctgccggata tcactactca acagttcacg accccggagt tgaccatcga cccgatcacg 23 40 

50 ctgggtgggt ttacgctgcc gcagctcagc atcccggcga ttaccacccc ggcgttcacg 24 00 

atcgatccga tagcgctggg tggtttcacg cttcctcaga tcatgacgcc cgagataacg 24 60 

actccaccgt tcgccatcga cccgatcgga cttagcggtt tcaccctccc ccaggtcaat 252 0 

atcccggaga tcaccacgcc agagttcacc atccagccgg tgggcttggc ggccttcacc 2580 

acacccgcac tcaccatcgc cagcatccac ctgccgagca ccaccatggg cggattcgca 2640 

55 atcccagcgg ggccgggata cttcaactcg agcgcaacgc cctcgttggg ctttttcaac 2700 

gccggaatcg gtgggaactc gggcttcggc aacagcggct cgggactgtc gggttggttc 2760 

aacacaagtc ctgttgggct gctagccggc tcgggctacc agaactacgg tggtcttatc 282 0 

tccggcttct ccaaccttgg cagcggcata tcgggcttcg ccaacaccgg caccctgccg 28 80 

tttgccgtga ccagcttggt ctccggtttg gccaacatcg gcaacaacct gtcgggcctg 294 0 

60 ttcttccaga gcaccacgcc ataa 2964 
<212> Type : DNA 
<211> Length : 2964 

SequenceName : SEQ ID 53 7 
SequenceDescription : 

65 

Sequence 
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<213> OrganismName : Mycobacterium tuberculosis H3 7Rv 
<40 0> PreSequenceString : 

atgtcgtttg tagtcgtggc gccggaggtg ttggcggcgg ccgcttcgga tctagcgggc 60 

atcgggtcga cactggccjca ggccaacgcc gcggcgttgg cgccgaccac cgcggtgttg 12 0 

5 gccgcgggtg ctgatga^gt ttccgcggca atcgcgtcgc tgtttggggc gcatggtcag 18 0 

gcgtatcagg cggtgagcgc ccaaatgtcg gcgtttcacg cccagttcat gcaggcgttg 240 

acgggtgccg gcggggcfcfca tgcggctgcg gaggcggtca acgtctcggc ggcgcagagc 300 

gtggaacaag acctgttggc cgcgatcaac gctcgcttcg agcggatttt tgggcgcccg 360 

ctgatcggtg atggcgccaa cggcgggccg ggacaagacg gcgggcccgg cgggttgctg 420 

10 tacggcaacg gtggcaacgg cggcaccagc acgaccgtgg ggatggccgg cggcaacggt 480 

ggtgccgccg ggctgatcgg caacggtggg ttcgggggcg gcggcgggcc cggcgcggcc 540 

ggcggcaacg gcggcgccgg cgggtggcta ttcggcaacg gcggcgccgg cggtgccggc 600 

ggcctcggcg tagcgcccgg cgtgcccggc ggcgccggcg gtgccggcgg cgccggcggt 660 

gtcggcggac ccgc.cgggtt gtggggccac gggggtgccg gcggggcggg, tggtgccggc - 720 

15 gtggctggcg ccggcggctt cgaggggacg atcggtgccg gcggtgccgg cggtgtcggc 78 0 

ggtgccggcg gtgtcggcgg tgccggcggt gccggcgggt ggctgtacgg cgacgccggt 840 

gccggtgggg atggtggfcgt cggcggtgcc ggcggcaccg gcgggttagg caaccgtggc 900 

ggcgccggtg gcgccggggg cgccggtggt gtcggcggcg ccgggggtgc cgccgggctg 960 

tggggcggcg gtggtgccgg cggggtgggt gggaccggcg gcggcgccgg cctcggtgct 1020 

20 cagagcgtca ccttcagtag tagcttaagt ggcctttccg gtggcgacgg cggcgccggc 10 80 

ggggccggtg gcgccggfcgg cgccggtggc accggtgggt ggctgtatgg cggcggtggt 1140 

gccgccggat ccggcggcjga cggtggtacc ggcggtcagg gcggcgccgg cggcgccggt 1200 

gtatttagcc tattcgga/tc cggtggcggc cccggcggca acggcggcgt cggcggcgtc 12 60 

ggcggtgtcg gcggtgcfcgg cgggcgtgcc ggcttgttcg gcgtcggggg cctcggcggc 1320 

25 gcgggtggcg acgccggtiga ctccggcgaa ggcggcttcg gcgggccggg gctcgccggc 13 80 

gggctgttcg gcaaccccgg caacggcggc gtcggcggga tcggcggcga cgccgcagcc 1440 

ggcggcgccg gtggggccgg aggcaacggt ggggccggag gcaacggtgg gtggttgttc 15 00 

ggcaatggtg gtgccggcgg ctccggtggc gacggcggcg ccgccggccg tggcggtgcc 1560 

ggcaacttgg gctcggccgg gggtatcaac gcccccgccg gtaaccccgg cagcggctcg 1620 

30 gtcggcatcg gcggtgccgg tggtgccggc ggcaccgccg ggctgttcgg cgacggtggg 1680 

gctggtgggg ccggtggtgc cggcgccgcc ggcggcttcg gcggcatcag cgccgccacc 1740 

ccctcggcgg gcagtgaggg cgccatgggt ggggccggtg gtgttggcgg caacgccagg 18 00 

ctgttgggca ctggtggcgc cggtggagtc ggcggcggcg gcggggccgg cggcgacgga 1860 

ggccgcggcg gagtcgcaac ccccggcggt cagggcggtg acgctgggga cggtggcgcc 1920 

35 ggcggggccg gcggcaafcgg cggcggcgcc agcggcgccg gcgggtggct gttggggacc 1980 

ggtggtgccg gcggtgccgg tggtaacggc ggcaatggcg gaaaagccgg ttttagccct 2040 

gggccgacca acttcggfcct caacggcgcc ggtggtggtg gtggtgtcgg cggcaacggc 2100 

gccaccggac cctggctcgtt cggcgacggc ggccccaccc caggcagcac cggtgccggt 2160 

gcggccggtg gtcacggcgg cgacgcccag ctgatcggca acggcggcca cggcggggcc 2220 

40 ggcggcaccg gggtgccgaa cgggtcaggt ggtgccggcg gcctcagcgg gctgctgttc 2280 

ggcgagccgg gggcgaacgg gtag 23 04 
<212> Type : DNA 
<211> Length : 2304 

SequenceName : SEQ ID 538 

45 SequenceDe script ion : 

Sequence 



<213> OrganismName : Mycobacterium tuberculosis H3 7Rv 

50 <4 00> PreSequenc eSt ring : 

atgtcgtttg tgatcgccjaa ccccgagatg ctggcagcgg cggcgaccga tttggccggc 60 

atccggtcgg cgatcagcgc cgcgaccgcg gcggccgcgg ccccgacgat ccaggttgcc 120 

gcggccggcg ccgacgacjgt gtcgctggcc atctcggcgc tgtttggcca gcacgcccag 18 0 

gcctatcagg cgctcagcgc ccaggcgacg atctttcacg accagttcgt gcaggccctg 240 

55 acctccggcg gcaacctgta tgcggccgcc gagagccaca ccgtcgagca gatggtgctc 3 00 

aacgcgatca acgcgcccac ccagacactg ttcggccgcc cgctgatcgg cgacggcgcc 3 60 

aacgggaccg cggagaaccc ggacggccaa aacggcggcc tgctgttcgg caacggcggc 420 

aacggcttta cccagacgac cgccggggtg gccggcggca acggcggcag cgcggggttg 480 

atcggcaacg gcggggccgg cggcggcggc ggggccggcg ccgccggcgg cctcggcggc 540 

60 aacggcgggt ggctgtacgg caacggcggg gccggcggca tcgggggcgc gggcaccgga 600 

accggtggtc acggcggggc cggcggggcc ggcggccggg cctggctgtg gggcaccggc 660 

ggggccggcg gagccggcgg tgacggcggc tggttgttcg gcgacggcgg ggccggcggc 72 0 

accggcggca acggcggcag cggctttaac agcttgacct cttcggtcgg cggcgccggc 780 

ggggccggtg ggcacgccgg gctgttcggc gccggcggga ccggcgggac cggcggcatc 840 

65 ggcgggcaaa acaccgagac cggcccggcc gccagcaacg gcggcgcggg cggcgccggt 900 

ggcggcggcg ggtacctggt cggcgatggc ggcgccggcg ggaccggcgg ggccggcggg 9 60 

aagaattcca gcggtggcgc caccctcacc gggggcaccg gagggaccgg cggggccggc 1020 
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ggggcggccg ggtggctcta cggcagcggc ggcgccggcg gtgccggcgg cgccggcggg 1080 

ctcaacaacg ccggtgjcftgc caccggcggc accggcggta ccggcggagc cggcggctct 1140 

ggagcgtggc tgtaccggcaa cggcggggcc gccggggccg gcggcaacgg cggcaacaat 12 0 0 

accagcgccg gcaccggtgg tgtcggggct agcggcggga ccggcggaaa cgccgggctg 12 60 

5 atcggcgccg gcggccacgg cggggccggc ggcgccggcg gaaaccaaac cggtggcgtg 1320 

ggcaacggcg gggccggcgg gaacggcggc gccggcgggg ccggtggtca gctgtacggc 13 80 

aacggcgggg acggcggcaa cggcggggcc ggcggggcca acatcgccgg cggcaatggc 1440 

agcgacggcg gcgccg-ccgg ccacggcggg gccggcggga gcgcccggct gatcggagcc 1500 

ggcggccacg gcggggacgg cggcgccggc gggaacaccg ccggcagaag ggccgacgcg 1560 

10 atcgccggca ccggcgggga cggcggcaac ggcgggaatg gcggcttgct aagcggcaac 162 0 

gccggggccg gcggcca.cgg cggggcgggc gggagcagca ccgcgaccac caccaccgga 1680 

acacccccaa cgggtgcaac gggcggcaat ggcggcaacg gcggggccgg cggcacggcc 1740 

gggtttaccg gcagcggcgg catcggcggc aacggcgggg ccggcggcac cggcggtaac 1800 

gccggtgtcg ccttgtcggt tggcagcacg ggcggactgg gcggtaacgg .-oggcagcggg -I860 

15 ggcctcggcg gcggcggcgg gtcgctcttc ggcaatggcg gggccggcgg tgtcggcgca 192 0 

accggcggaa acggcggaag cggtatcggg cccgccagcg tgggtggcaa cggcggcaag 1980 

ggcggcgttg gtgcggccgg cgggcttgcc gggcagatcg gcaacggcgg tagtggtggg 204 0 

tccggcggtg ccggggcjcaa cggcgggacc ggcgataccg ccggcaacgg tggcaatggt 2100 

ggtgccggcg cggtcgcjcgg caacgcccag ctcatcggca acggcggcaa cggcggtggc 2160 

20 ggcgggaacg gcggaaccgg cgccgacggc acctaa 2196 



<212> Type : DNA 

<211> Length : 2XB6 

SequenceNamet : SEQ ID 539 
SequenceDescription : 

25 

Sequence 



<213> Organi smName : Mycobacterium tuberculosis H37Rv 
<400> PreS equenceS t ring : 

30 atgccggggc ggttcacjaaa cttcggtagc caaaacctgg gtagcggcaa catcggcagc 60 

accaacgtgg gcagcggcaa catcggcagc accaacgtgg gcagcggcaa catcggcgac 120 

acgaacttcg gtaacgg-aaa caacggcaac ttcaactttg gtagcggcaa taccggcagt 18 0 

aacaacatcg gcttcgcfaaa caccggcagc gggaatttcg gtttcggaaa cacgggcaac 240 

aacaacatcg gtatcgcfcjct caccggcgat ggtcagatcg gcatcggcgg actgaactcg 3 00 

35 ggcagcggaa acatcggrttt cgggaactcc ggcaccggaa acgtcggttt gttcaactcc 360 

ggcaccggca acgtagcjctt cgggaactcc ggtactgcga acactggatt cgggaacgcg 420 

ggcaacgtca acaccggatt ttggaacggc ggcagcacaa acactggcct cgctaacgcc 480 

ggcgccggca acacaggctt tttcgacgct ggcaactaca acttcggcag tcttaacgcc 540 

ggaaacataa actcgagttt tgggaattcg ggtgacggca acagtggttt cctcaatgct 600 

40 ggcgacgtca actccgg"tgt gggcaatgcg ggtgatgtca acactggctt agggaactcg 660 

ggcaacatca atactggtgg gtttaatccg ggcacgctca acacgggctt cttcagcgcg 720 

atgacccaag ctggtccgaa ttcgggcttc ttcaacgccg gtaccggtaa ctctggtttc 78 0 

gggcacaacg acccggctgg cagtggcaac tcgggcattc agaactcggg cttcggcaac 84 0 

tcgggctatg tcaataccag caccacaagc atgttcggcg gtaactcagg ggtgctcaac 90 0 

45 acgggctacg gcaactcagg tttctataac gcggccgtca acaacaccgg gatttttgtg 960 

accggcgtga tgagttccggg atttttcaat tttgggacgg gcaactcggg cctgctggtc 102 0 

agcggcaatg ggctttccjgg tttcttcaag aacttgttcg gatag 1065 
<212> Type : DNA 
<211> Length ; 106 5 

50 SequenceNatne : SEQ ID 540 

SequenceDescription : 



Sequence 



55 <213> Organi smNaxne : Mycobacterium tuberculosis H3 7Rv 
<4 00> PreS equenc eSt ring : 

atgtcgttcg tgctggcgat gccggaggtg ttggggtcgg cggcaacgga tctggccgct 60 

ctgggctcgg tgctgggcgc ggccgatgcg gccgcggcgg ctacgacgac gggcatcgtg 12 0 

gccgcggccc aggatgagjgt gtcggcggcg atcgcggcgt tgttttccgc ccacggccgg 180 

60 gcctatcagg tggccagfccgc gcaggcggcg gcggttcacg cccagttcgt ggaggcgttg 240 

agcgcgggtg cgggggccta cgccagcgcg gaggccgccg gcgcggcggt gctggccaac 3 00 

ccggcgcaga gcgtgcagjca ggacctgctg gccgccgtca atgcgcaaag tgtcgcgctc 360 

acggggcgcc cgttgatccjg caacggcgcc aacggggccc cgggcacggg ggccaatggt 42 0 

gcgccgggcg ggtggttcfct cggtaatggt ggggccggcg ggtccgccgc cgctggctcg 48 0 

65 ggcctgcccg gcggggccgg cggggccgcc gggttgttcg gcaccggcgg ggctggtggg 540 

gccggcggga gttccacggt aggtgatggc gaggccgggg gtgccggtgg atcaggtggc 600 

tggttgttgg gcaccggtgg ggtcggcggg gtcggcgggc tcggggccgg cgccggtggg 660 
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gccggcgggg ttggtggggc cggcgggctg ttgggtgctg gcgggcacgg cggcgccggc 720 

gggctaggcg ccgtcaccgg tggggtcggg ggaactggcg gagccggtgg gctgctggcc 78 0 

gggctgctgg ccgggccggg cggggccggc gggaccggcg gacgtggcfct tctcaacaac 840 

ggtggggtcg gtcjgggctgg cggcaacgcc gggctgctgt tcggtgccgg cggcaccggt 900 

5 ggatccggcg gagccggcct aggtggtgac ggtggggccg gtggggccgg cggcaacacc 960 

ggtgtgctgt tcggcaacgc cggatccggg gggaccggcg ggttcggcga taccgacggg 102 0 

ggagccggcg gtgccggcgg tgacgccggc tggttgggct ccggtggggt cggcggggcc 108 0 

ggcgggttcg gcgaaaccgg tgacgggggt gtcggcgggg ccggcggcaa ggccgggttg 1140 

ctgatcggta accjgcggggc cggcggcgcc ggtgggcaag gcgccgfcgac cggcggtacc 1200 

10 ggcggggccg gcg-gcgacgg ggtgctgatc ggcaacggcg gcaacgccgg catcggcgga 12 60 

accggaccga cccjcgggtga taccggcgcg ggtgggatca gtgggctgct gctgggcgcc 132 0 

gacggcttca acaccccggc cagcgcctct ccgctgcaca ccctgaaaca acaggcgctg 13 80 

gccgcgatca acgcgccgac ccagacactg accgggcgac cgctgatcgg caacggcacc 1440 

,cccggggcgg tcggcagcgg ggccaccggg gcccccggtg ggt.Cfgcfcgct cggcgacggc 1500 

15 ggggccggcg ggfcccggcgc ggcgggctcg ggcgcgcccg gcggggcggg cggggctgcc 1560 

gggctgtggg gtaccggcgg ggccggcggg gccggaggca gctcggcggg tggcggcggg 162 0 

gccggtgggg ccggcggggc cggcggctgg ctgctcggcg acggcggggc cggcgggatc 1680 

ggcggagcca gcaccgtact cggcggcacc ggcgggggag gcggggtcgg tgggctgtgg 1740 

ggcgccggtg gggccggcgg ggccggtgga accggccttg ttggtggcga cggcggggcc 18 00 

20 ggtggggccg gcgggaccgg cggactgctg gccgggctga tcggtgccgg cggaggtcac 1860 

ggcgggaccg gcgggctcag cactaatggc gacggcgggg ttggcggggc cggcgggaat 192 0 

gccggaatgc tcgccgggcc gggcggcgcc ggcggagccg gcggtgacgg cgaaaacctg 1980 

gacaccggtg gggacggcgg ggccggcggt agcgcagggc tgctgttcgg cagcggcggc 2 040 

gccggcggcg ccggcggatt tggtttcctc ggtggggacg gcggggccgg tggcaacgcc 2100 

25 gggctgctgt tgtccagcgg cggggccggc gggttcggcg ggttcggcac cgccggtggg 2160 

gtcggtgggg ccggcggcaa tgccggctgg ctgggcttcg gcggggccgg tggcgtcggc 2220 

ggcagcgccg ggctgatcgg caccggcggc aacggcggca acggcggcac cggcgccaac 22 80 

gccggcagcc ccggaaccgg cggcgccggc gggttgctgc tgggccaaaa cgggctcaac 2340 

gggttgccgt ag 23 52 

30 <212> Type : DNA 

<211> Length = 23 52 

SequenceName : SEQ ID 541 
SequencerDescription : 

35 Sequence 



<213> OrganismName r Mycobacterium tuberculosis H37Rv 
<400> PreSequenceString : 

gtgtcgttgg tgatcgcgac gccgcagctg ctggcaactg cggctttgga tttagcgagt 60 

40 attggttcgc aggtgagcgc ggctaatgcg gccgcggcga tgccgacgac ggaagtggtg 120 

gctgcggctg ccgatgaagt gtcggcggcg attgcggggt tgttcggggc ccatgctcgg 180 

cagtatcagg cgctcagcgt acaggtggca gcgtttcacg agcagtttgt gcaggcgttg 24 0 

actgcggccg cgggtcggta tgccagcact gaggccgctg ttgagcggag tctgctgggt 3 00 

gcggtgaatg cgcccaccga ggcgcttttg gggcgcccgt tgatcggaaa cggcgccgac 360 

45 gggacggcac ccgggcagcc tggcgcggcc ggcgggttgc tgtttggcaa cggtggcaac 42 0 

ggcgcggctg gcgggttcgg tcaaaccggc ggcagcggag gcgcggccgg gttgatcggc 4 80 

aacggcggca acggcggggc cggtggtacc ggcgcggccg gcggtgccgg tgggaacggg 540 

gggtggttgt ggggcaacgg cggcaacggc ggtgtcggcg gcaccagcgt ggccgcaggc 600 

atcgggggtg cgggcggtaa cggcggcaac gccgggctgt tcggccatgg cggcgccggt 660 

50 ggtaccggcg gcgccggcct cgccggggca aacggggtca atcccacgcc cggccccgcg 72 0 

gccagcaccg gggacagccc ggcagatgtg tccggcatcg gtgatcaaac cggcggcgac 780 

ggcggcacgg gcggccatgg cactgccggc acgccgaccg gtggcaccgg cggcgacggt 840 

gccaccgcga cggcaggctc gggcaaggcc accggcggtg ccggtggtga cggcggtacc 900 

gccgctgccg gtcjgcggcgg cggcaacggc ggcgacggcg gagtcgcgca gggcgacatt 960 

55 gcgagcgcct ttggcggtga tggtggcaac gggtccgacg gtgtagccgc cggcagtggg 102 0 

ggtggtagcg gccjgcgccgg aggcggcgct ttcgtacaca tcgccactgc cacctctacc 1080 

ggtggtagcg gcggtttcgg tggtaacggg gctgccagtg ccgcctccgg cgccgacggt 114 0 

ggcgcagggg gagctggcgg caatggtggc gccggcgggt tgctattcgg tgatggcggc 1200 

aacggtggcg ccggtggcgc gggtggtatc ggtggtgacg gcgccacggg ggggcccggg 1260 

60 ggaagcggcg gcaacgctgg catcgcgagg tttgacagcc cagaccccga ggcagaaccc 132 0 

gatgtggtcg gcggcaaggg tggtgatggc ggcaagggcg gcagcggcct tggcgtcggc 13 80 

ggcgccggcg ggaccggcgg cgcgggcggc aacggcggcg ccggcgggtt gttgttcggc 144 0 

aacggcggca acggcggcaa cgccggggcc ggcggggatg gcggcgccgg cgttgccggt 150 0 

ggggttggcg gtaacggcgg cggtggtggc accgcgacgt ttcacgaaga cccggtcgct 15 60 

65 ggtgtctggg cggtcggtgg cgtaggtggt gatggtggct ccggcggcag ctcgcttggt 1620 

gtcggcgggg tgggcggagc cggtggcgtg ggtggcaagg gtggcgccag cggcatgtfcg 16 8 0 

atcggcaacg gcggcaacgg tggcagcggc ggagtcggtg gggccggtgg agtcggcggg 174 0 
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gctggcggtg acggcggcaa cggcggctcc 
aactccatcg gcggggccgg cgggacgggc 
aacggtggcg ctggcggtat tgccggcggt 
gccgcaggag tcagcggcgc tgacggtatc 
ggcgcgggcg gtagcggcgg tgaggcaggc 
cctggcgttt ccggcaccga aggcatggcc 
<212> Type : DNA 
<211> Length. : 2085 

SequenceName : SEQ ID 542 
SequenceDescription : 

Sequence 



ggtggcaacg 
ggcaacgggg 
gcgggtgggt 
ggtggcgcgg 
gcggggggcc 
ggcgcgcccg 



ccagtacttt 
gcaacggcgc 
ccggagggtt 
gcggcgcagg 
tcaccaacgg 
gctag 



tggcgatgag 
aaacggcggt 
cctcagcggt 
cggtgccggt 
ccccgggtcc 



1800 
1860 
1920 
1980 
2040 
2085 



<213> O^ganismName Mycobacterium tuberculosis K3 7Rv 
<400> PreSequenceString : 

atgtcgtttg tgattgcggc gccagaggtt atcgcggcag cggcaacgga 

ctcgagtcga gcatcgccgc ggccaacgcg gccgcggcgg ccaacaccac 

gccgcgggtg ccgatgaagt ctcgacggcg gttgcggcgc tgttcggcgc 

gcctatcagg cgctcagcgc ccaagcgcag gcgtttcatg cccagttcgt 

acctccggtg gcggcgcgta cgccgccgcc gaggccgccg ccacctcgcc 

ccgatcaacg agttcttcct ggcgaatacc gggcgcccgc tgatcggcaa 

ggcgcccccg gcaccggggc caatggcggg gacggcggct ggttaatcgg 

gccggaggat ccggcgcggc cggcgtcaac ggcggggccg gcggcaacgg 

gggctcatcg gcaacggcgg cgccggcggc gccggcggga gggccagcac 

ggcgccggcg gcgccggcgg ggccgccggc atgctgttcg gggccgccgg 

cccggcggat tcgcagccgc tttcggcgcc accggcggcg ccggcggggc 

ggcgggctgt tcgccgacgg cggggtcggt ggcgccggcg gggcaaccga 

ggcggggccg gcggatccgg cggaaacggc gggctgttcg gcgccggcgg 

cccggcggat tcggcatctt tggcggcggc gccggcgggg atggcggctc 

ttcggcgccg gcggcaccgg cggtagtggc gggacgagca taatcaacgt 

ggtggggccg gcggcgacgc cggcatgctc agcctcggtg ccgccggcgg 

agcggcggat ccaacccgga cggtggcggc ggtgccggtg ggatcggagg 

acgttgttcg gctccggcgg cgccggcggc gtctgcgggc tgggcttcga 

gcgggcggcg ccggcggaaa ggcgggtctg ctcatcgggg cgggcggtgc 

ggcggcgggt ccttcgcggg cgccggcgga accggtgggg ccggcggcgc 

gtcggcaacg ccggcaacgg cggcaacggc ggcgccagcg cgaacggcgc 

ggcggcgccg gcggaagcgg cgtgctgatc ggcaatggcg gtaacggtgg 

actggcgcac ccgccggcac ggccggcgcc ggcggcttag gcgggcagct 

gacggattca acgcccccgc cagcacgcca ctgcacaccc tgcagcaaca 

gcgatcaacg agcccaccca agccctcacc gggcgaccgc tgatcggcaa 

ggcactccgg ggaccggggc cgacggtggg gccggcgggt ggttgttcgg 

aacggcggcc acggagcgac cggcgccgac ggcggagacg gcggatccgg 

ggaatcctgt caggtattgg cggcaccggc gggtccggcg gcatcggaac 

ggcgggaccg gcgggaccgg cggagccgcc ttgctgatcg gctccggcgg 

agcggcgggt ttggcctgga caccggcggg gccggcgggc gcggcggtga 

ttcctcggtg ccgcagggac cggcggccag gccgccctct cccaaaactt 

ggcggcaccg ccggagccgg tggtaccggt gggctgttcg ccaacggcgg 

gccggcgggt ttggcgcgaa tggcgggacc ggggggaacg gattgctctt 

ggcaccggcg gggcgggcac gctcggcgcc gacggcggcg ccggggggca 

ttc gg c gccg gcggcactgg cggtgctggc ggcagcagcg gcggcacctt 

ggcggcagcg gcggcaacgc cggcctactc gccctcggcg cctccggcgg 

agcggcggca gcgccttaaa tgtcggcggg accggcgggg tcggcgggaa 

ggcggctcgc tcttcggctt cggcggtgcc ggcggcaccg gtggctccag 

agcagcggcg gcaccggcgg tgacggcggt acggccgggg tgttcggcaa 

ggcggcgccg gcgggtttgg cgctgacacc ggtgggaatt cgagctcggt 

gtgctgatcg gcaacggcgg caacggcggc aacggcggga aagccggcgg 

gccggcggca ccagcgggct gatcatcggc gagaacgggc tcaacggcct 

<212> Type : DNA 
<211> Length : 2514 

SequenceName : SEQ ID 543 

SequenceDescription : 



tttagccagt 
agcactgctg 
ccacggccag 
gcaggcgttg 
gctgctcgcc 
cggcaccaac 
caacggcggc 
cggcgccggc 

ggggaccggc 

ggtcggcggt 
cggcggaaac 
cgccggcacc 
caccggtggg 
tggcgggctg 
cggcgggaac 
cgccggcggc 
cgacggcggc 
cgccggcggg 
cggcggcgcc 
gcccgggctc 
cggcgccgcc 
tagtggcggc 
actaggtcga 
aatcctcaac 
cggcgccaac 
caacggcggc 
cggggccggc 
gacgggccaa 
caccggcggc 
cgccggcttg 
tatcggtgcc 
ggccggcggg 
cggcgccggc 
cggcgggctg 
cggcggcaac 
cgccggcggc 
cggcggcagt 
cggcatcggc 
cggcggcgac 
ccccaacgcc 
cacacccggc 
gtaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2514 



Sequence 



<213> OrganismName : Mycobacterium tuberculosis H3 7Rv 
<4 00> PreSequenceString : 
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atgtcgtttg tgatcgcggt gccggagact atcgcggcgg cggcgacgga tctagccgat 60 
ctcggctcga cgatcgctgg ggccaacgcg gctgcggcgg ccaacacgac gagcctgctg 120 
gccgccggtg ccgatgagat ctcggcggca atcgctgcgt tgttcggcgc gcacggccgg 180 

gcctatcagg cggcgagcgc cgaggcggcg gcgtttcatg gtcggttcgt gcaggcgctg 24 0 
5 accaccgggg ggggcgccta tgcggccgcc gaggccgccg ccgtgacgcc gctgctcaac 3 00 

tcgatcaacg cgcccgtcct ggccgctacc ggccgtccgc tgatcggtaa cggggctaac 3 60 

ggtgctcccg gcaccggggc caacggaggg gatgccggct ggttgatcgg caacggtggc 42 0 

gccggcggat ccggtgcaaa gggcgccaac ggcggggctg gtggccctgg tggggccgcc 48 0 
gggctgttcg gcaacggcgg ggccggcggt gccggcggaa ccgccaccgc caacaacggg 54 0 

10 atcggcgggg ccggtggcgc tggcgggtcc gccatgctgt ttggggccgg cggcgccggc 60 0 

ggcgccggcg gggctgcgac gfccfccttgtc ggtggcatcg gcggtaccgg cggaaccggc 660 
ggcaacgccg gtatgctcgc cggcgccgcc ggggccggcg gtgccggcgg gttcagcttc 72 0 

agcactgccg gtggggctgg cggcgccggc ggggccggtg ggctgttcac caccggcggt 780 
gtcggcggcg,, ccggtgggca gggtcacacg ggcggggcgg gcggcgccgg cggggccggc 840 

15 gggttgtttg gtgccggcgg catgggcggg gcgggcggat tcggggatca cggaacgctc 90 0 

ggcaccggcg gggccggcgg ggacggtggg ggcggcggtt tgttcggcgc cggcggggac 960 

99cggggcag gcgggtcagg actgaccacc ggcggcgctg ccggtaacgg tggtaacgcc 1020 

gggacgcttt ccctgggtgc cgccggtggc gccggcggca ccggtggggc tggcggcact 10 8 0 

gtcttcggtg gtggtaaggg cggcgccggc ggagccggcg gtaacgccgg catgctcttc 1140 

20 ggctccggcg ggggtggcgg caccggcggg ttcggcttcg ccgccggcgg gcagggtggg 12 00 

gtcggcggca gcgccggcat gctcagcggc tccggcggct ccggcggtgc tggcggctct 1260 

gggggccccg cgggcaccgc cgcaggtggg gcgggtgggg cgggtggggc gcccgggttg 1320 

atcggcaacg gcggcaacgg cggcaacggc ggcgagagtg gcggcaccgg tggtgtcggc 13 80 

ggggccggtg gaaatgccgt gctgatcggc aacggcggcg agggcggcat cggcgcgctc 1440 

25 gccggcaagt ccggcttcgg cggcttcggc gggttgctgc tgggcgccga cggatataac 150 0 

gctcccgaga gcacctcgcc atggcacaac ctgcagcagg acattctcag tttcatcaac 1560 

gaacccaccg aggcattgac cggacgcccg ctgatcggta acggcgacag tgggacgccg 162 0 

ggaaccgggg acgatggcgg tgccggcggc tggttgttcg gcaacggcgg caacggcggg 168 0 

gccggtgcgg ccggcaccaa cggcagcgcg ggcggcgccg gtggggcagg cgggatcctg 1740 

30 tttggcaccg gtggcgccgg cggggccggc ggcgtcggaa cggcgggtgc cggcggggcc 18 00 

ggtggcgccg gcggatccgc cfctcttgatc ggttccggcg gtaccggtgg tgtcggcggg 18 60 

gccgccacca ccaccggcgg cgtcggcggg gccggcggga acgccggctt gctcatcggc 192 0 

gcggctgggc tcggcgggtg tggcggcggc gctttcaccg caggcgttac cactggcggc 198 0 

gccggcggga ctggcggcgc tgccgggttg ttcgccaacg ggggggccgg cggcgccggc 2040 

35 gggaccggca gcaccgccgg gggcgccggc ggggccggcg gggccggcgg gctgtacgcc 210 0 

cacgggggaa ccggcggacc aggtgggaac ggcggctcca cgggggccgg agggacaggc 2160 

ggtgccggcg ggcccggtgg gctgtacggt gccggcggct ctggcggggc cggcggtcat 2220 

gggggcatgg ccggcggggg tggcggtgta ggcggcaatg ctggctcgct 'caccctcaat 2280 

gcgtcgggcg gtgccggcgg cagcggcggc tccagcctgt caggcaaggc cggtgctggc 2340 

40 .ggcgccggcg gcagcgcggg afctgttctac ggctccggcg gggccggcgg caacgggggc 2400 

tacagcctca atggcactgg cggtgatggc ggcaccggtg gggccggcca aatcaccggc 2460 

cttcgcagcg gcttcggcgg cgccggcggc gccggcggcg ccagcgatac cggcgccggc 252 0 

gggaacggcg gcgccggcgg caaggccggg ctgtacggca acggcggtga cggtggcgcc 2580 

ggcggggatg gcgccaccag cggcaagggt ggagccggcg gcaacgccgt ggtgatcggc 2640 

45 aacggcggca acggcggcaa tgccggaaaa gccgggggca cggcgggtgc cggcggcgcc 270 0 

ggtgggctgg tactcggccg ggatgggcag cacggcttga cgtag 2745 
<212> Type : DNA 
<211> Length : 2745 

SequenceName : SEQ ID 544 

50 SeguenceDescription : 

Sequence 



<213> OrganismName : Mycobacterium tuberculosis H37Rv 

55 <400> PreSequenceString : 

atgtcgttgg tgatcgtaac cccggagacg gtggccgcgg cggcctcgga tgtggcgcgc 60 

atcgggtcat cgatcggtgt agccaacagc gcggcggcgg ggtcaaccac cagcgtgctg 12 0 

gccgcgggcg ccgatgaggt gtcggcggcg atcgcgacgc tgtttggcag ccatgctcgg 18 0 

gagtatcagg cgatcagcac gcaggtggca gcgtttcatg accgatttgc gcagacgtta 240 

60 agcgccgcgg tcggctcgta tgtcagcgcg gaggcgacca acgccgcacc gttggcgacg 300 

ctggagcaca acgtgctcaa tgccctcaat gcgcccaccc aggcgttgct gggtcgcccg 3 60 

ttgatcggtg atggggcggc tggagcaccc ggcaccgggc aggccggcgg ggccggcggg 42 0 

atcttgtggg gcaacggtgg ggccggcggc tcgggcgcgc ccggccaagt cggcggggcc 48 0 

ggcggggccg ccgggttgtt cggcaccggc ggggccggtg gggccggtgg ggccggcgcc 540 

65 gccggtgggg ccgggggtag cggcggctgg ctgctgggca atggtggagt cggcggggcc 60 0 

ggcgggcaga gcctgctggg cggggcaacc ggcggggccg gcggcaacgc cggactgttc 660 

ggggtcggcg gaaccggcgg gcccggcggg cccggcgggc ccggcggggt cggcggtacc 72 0 
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ggtggtgccg gtggcctggg cgggaccctc tacggggccg gcgggcacgg aggtgccggc 78 0 

gggcccggcc cgatcggtgg tgtgggtgga cacggcggtg tcgggggtgc ggccgggctg 840 

ttgggcgtgg gcgggcacgg tggtgccggc ggacacggcg cggagggtgt ggccggcgca 900 

gccggtgagg acttgtcccc gcacggtacg tccgcjtgggg tcggcggcga cgccggcgat 960 

5 ggcggcaccg gagggcgggg cggctggctg gccgcjcgccg gtggggccgg cggggccggt 102 0 

ggggttggcg ggaccggcgg ggccggcggg gccggatttt ctcgtgcctt gattgtcgct 10 80 

ggggataacg gcggtgatgg tggtaacggc gggafcgggcg gggctggcgg ggctggcggc 1140 

cccggcgggg ccggcggcct gatcagcctg ctggcjcggcc aaggcgccgg cggggccggc 12 0 0 

gggaccggcg gcgctggtgg cgtcggcggt gaccgcggag ccggcggccc cggcaaccag 1260 

10 gccttcaacg caggtgccgg cggggccggc ggccatggcg gtgaccccgg cgccggcggg 13 2 0 

gccggaggca ccggcggagc cggctccatc accgcjcgctc agggcgccat cggcgccacc 13 80 

cccaccagcg gcggcaacgg cggggccggc ggcaacggcg ccaacgccac caccgcgggc 1440 

accaacggcg ccaacggcgcj acccggcggc catgcjcgggc tggtcggcaa cggccjgcgcc 150 0 

ggcggcaacg gcgccaacgg cgccgcaggg acgaacgcga gcgattcggg cgcagtcggt • 1560" 

15 ggcaaaggca atagcggcgg caacggcggc caggcjcggcg ccggcggcga cggcggaact 1620 

ctcgccggca atggcggggc cggtgggact ggcgcgccgcg gcgcagacgg cgggctcggt 168 0 

ggtagtggtg ccgagggtgc caatgccacc accgcgggcg agcgtggcca ggacggtggt 1740 

aaaggcggca acggtggggt cggcgggacc ggcggcaacg ccgtcgcacc cggtgccaac 18 0 0 

ggcggccatg gcggcaacgg cggcaatcct ggcttcagcg gcgctggcgg gctcgggggg 1860 

20 ctcagcggtg acggcgtgac ccgcgccgcg caggcjtgcga cccccgactt tgcagacacc 1920 

gggggcaaag gcggcaacgg cggtaacggc gccaacgccg ttgcacccgg cggcaccggc 1980 

gccagtggcg gagccggcgg gaacgctggg gctggcggca agggcgggga aaacatcatc 2040 

ggcgatggcg gcggcggaaa cggcggggcc ggcgcjcaagg gcggcgccgg caccctgcta 2100 

ggtctcacgg tatttggcga caatggcggc gccgcjcgtcc tcggcgactc gacggaccca 2160 

25 gatggcagcg gcggtgctgg tggcgcgggc ggcgccggtg gcgccggtgg cgatccaacc 2220 

atctga 2226 
<212> Type : DNA 
<211> Length : 2226 

SequenceName : SEQ ID 545 

30 SequenceDe script ion : 

Sequence 



<213> OrganismName : Mycobacterium tuberculosis H37Rv 

35 <400> PreSequenceString : 

atgtcgtttg tcacggcagc tccagagatg ctggcgacgg cggcgcagaa tgtcgcgaat 60 

atcggcacat cgctgagtgc ggcaaacgcg acggcagcgg cgtccacgac ctcggtgctg 12 0 

gcggccggag ccgacgaggt atcgcaggct atcgcaaggc tgttcagtga ttacgccacg 18 0 

cactatcagt cgctgaacgc tcaagccgcg gcatfctcatc acagcttcgt gcaaacgttg 240 

40 aacgccgccg gtggcgccta ttcgagcgcc gaggcggcca acgcttcggc gcaggcgttg 3 00 

gaacagaatc tgttggccgt gatcaatgcg cccgcccagg cgttgttcgg gcgtcccctg 3 60 

atcggcaatg gcgcgaatgg aacagcggcc acjccccaacg gcggtgatgg tgggattttg 42 0 

tacggcaacg gcggcaacgg cttctcccaa acgaccgccg gggtggccgg cggcgccggt 48 0 

ggttccgcgg gcctgatcgg caacggcggc aatggtggcg ccggtggggc cggtgctgcc 540 

45 ggcggggccg gcggcgccgg cggatggctg ctcggcaacg gtggcgccgg cggtcccggc 60 0 

ggcccaacgg acgttcctgc cggcacaggt ggagccggcg gggccggcgg cgacgcccca 660 

ttgatcggct ggggcggcaa cggcgggccc ggcggtttcg ctgcttttgg aaacggtggg 720 

gccggcggca acggcggcgc cagcggttcg ctctfctggcg tcggcggcgc cggcggcgtc 780 

ggcggatcga gcgaagacgt cggcggcacc ggcgcjggccg gcggcgctgg ccgcggtcta 840 

50 ttccttggcc tgggcggtga tggcggcgcc ggcggcacca gcaacaacaa cggcggtgac 900 

ggtggcgccg gcggcaccgc gggaggtcga ttgttcagcc tgggcggtga cggtggcaac 960 

ggtggtgccg gtaccgcaat cggatccaac gccggtgacg gtggcgccgg cggtgacagc 1020 

agcgccctga tcggctacgc ccagggcggc tccgcjcggcc tcggcggctt cggcgaaagt 10 80 

accggcggcg acggcggcct gggcggcgcc ggcgctgtgc tcatcggcac gggcgtcggc 1140 

55 ggtttcggcg gcctcggtgg cggctccaac ggcaccgggg gcgcgggcgg cgcgggcggc 12 00 

acgggcgcca cgctgatcgg cctgggcgcc ggcggoggcg gcggcatcgg cgggttcgcc 12 60 

gtcaacgtgg gcaacggcgt cggcggtctg ggcggccagg gcggccaggg cgccgcgctg 132 0 

atcggcctgg gcgccggcgg tgccggcggt gccggjcggcg ccacagtcgt tggacttggt 13 80 

ggcaatggcg gtgacggcgg tgacggtggc ggcctgttta gtatcggcgt cggtggggac 1440 

60 ggcggcaacg ccggcaacgg cgccatgcct gccaatggcg gcaacggcgg caacgccggg 15 00 

gtcattgcca acggctcctt tgccccgtcg ttcgtcggct tcggcggcaa cggcggcaac 1560 

ggcgtcaatg gcggcaccgg cggcagcggc gggatccttt ttggcgccaa cggcgcgaac 1620 

ggaccgtcgt ag 1632 
<212> Type : DNA 

65 <211> Length : 1632 

SequenceName : SEQ ID 546 
SequenceDescription : 
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Sequence 

<213> OrganismName : Mycobacterium tuberculosis H37Rv 
5 <400> Pre Sequenc eSt ring : 

atgtcgtatg tattggcgac gccggagatg gtggcagcgg cagcaaacaa tttggcgcag 60 

atcggctcga cgttgagcgc ggccaatgcc gcggcgctgg ccccgaccac cggagtgctg 12 0 

gccgcgggcg ccgacgaggt gtcggcagcg gtggcgtcgc tgttttccgg gcacgcccag 180 

gcctatcaga cactcggcac gcaggcggcc gcgtttcatg aacggtttat ccaggccttg 240 

10 agcacggctg cgggcgcata tggtagcgcc gaggccgcaa atgcctcccc gctgcagcaa 3 00 

gcgttgaatg tgatcaacgc gcccacgcag acgctgctcg ggcgccccct gatcggcaac 3 60 

ggcaccaatg gtgcgccggg taccgggcag gccggcgggc cgggtggctt gttgtatggc 420 

aacggcggca atggcgggtc cggcggggtc ggccaagccg gcggagccgg cggcagcgcc 480 

gggttgatcg gcatcggcgg gaccgggggg gccggagggg ccggcgcggt cggcggcgtc 540 

15 ggcggcaacg gggggtggtt atacggcaat ggcggggccg gcgggctcgg ggggaccggg 600 

gtggccggcg tcaatggcgg tatgggcgca gcaggaggtg ccggcggcaa cgcctacctg 660 

ttcggttctg gcggcgcggg cgggcagggc ggtatggggg cagccggggc agacggcgtc 720 

aaccccacac ccaccggcac cgctgatgcc ggcagtaccg gcaccgacca gacgctgggc 780 

ggcaacgcca taggcggtaa cggtggcccc ggcgatgccg gcgacgcgat gacgtccggc 840 

20 ggcgctggcg ggtctggcgg gaacgccgtc tcaaccgtca acggcgacgc cgtgggcggt 900 

gagggaggca aagggggtga gggcgcctat ggcggcgctg gcggcgccgg cggcagcgcc 960 

gcttccatcg gcaatgcagc cataggaggt aacggcgcjtg ccggcgggaa cgcccaggcc 1020 

cccggcggtg tggggggcgc cggcggcgaa ggcggggatg cccaggtggg caccaattct 10 80 

cccagcaacg cggaagccgg taacggcggc agcggcggca acggcttcga cagcttcgct 1140 

25 tccggcggta ccggcggagc gggcggaacc gggggcgccg gtggccgcgg cgggctgctg 12 0 0 

atcggcgacg gcggcgccgg cggcgcgggc ggagtcggtg gtaccggtgg tagcggtgcc 1260 

cccggcggcg gcggcggcgc cgggggcgac ggcggtgccg ccaacaccga cagcgccggc 13 2 0 

agttcacgca aggcgttcgg gggcgatggc ggtgtgggcg gtgacggcgc gagcgccctc 13 8 0 

ggcaccggtg gcgaaggcgg tatcggcggc cagggcggta acgggggtgc tggcgggctg 1440 

30 ctcatcggca acggcggcgc cggtggtgtc ggcggcacgg ccggtgccgg tggtactggt 150 0 

ggttccggtg gtgctggagg tgccggaggc gccggtggtg gcggcaccaa cagtggtccg 1560 

ggcgcagcgt ttggcggtaa cggcaacacc ggcggcaacg gcggcaacgg cggcgccccc 162 0 

ggcgccctcg gcggcaaggg cgggtccggc gggctgattg gccgcgcggg cagcgacggt 1680 

ggcgttggtg ccggaggagc gggcggcgcg ggtggccjccg gcggcacagg cggtgagggc 1740 

35 ggcacgggcg gcgacgggaa aaccaccgac ggcaatcccg gcatgggcgg cagcccgggc 180 0 

agcgccggcc aacccggcta a 1821 
<212> Type : DNA 
<211> Length. = 1821 

SequenceName : SEQ ID 547 

40 SequenceDescription : 



Sequence 



<213> OrganismName : Mycobacterii 

45 <400> PreSequenceString : 

atgtcgtttt tgtttgcaca gccggaaatg 
atcggctcgg cgatcagcac ggccaatgct 
gccgccggtg ccgatgaggt gtccgccgcc 
acctatcagg cgctgagaac tcaggcggcg 

50 acgagcacgg caggcgcgta tgccagcgcc 
ggtgcgatca acgcgccgac catggcgctg 
gacggggcgc cggggaccgg gcaggccggc 
ggcaacggcg ggtccggcgc caccggtcag 
atcggccacg gcggggccgg cggcctgggt 

55 gccggcggat ggctgtgggg caacggcggg 
ggcgaccccg gtggtgtcgg cggtgccggc 
agcggagggt ccggcggcac cggcgggcaa 
ggcggcacgg ggggtatcgg cggcgccggt 
ggcgccggcg gacacggcgg gcagggcgga 

60 gccggcggca ccggcggcga cggccgcggc 
ggcgggcaaa ctggggttgg aggcaaagtc 
gccggcggta ccggcgggct catcggcaac 
gcaattagcg gcgccggcgg ggccggtggc 
ggcggcaacg gcggtgatat ccgtggccag 

65 ggtgggcaac tcattggcaa cggcggcacc 
aacggtcttg gcggtgctgg cggagccggc 
accggcgggg ccggcgggca cagcgcccag 



tuberrculosis H37Rv 



ctgggcgjcgg cggcgacgga tctggcaagc 60 

gcggcccjcgg ccgccacgac gcgtgtgctg 12 0 

gtggcgcjcgc tgtttagcgg ccacgcgcag 180 

gcgtttcacc agcagatcgt gcagaccctc 24 0 

gaggccgjcca acgtcgagca gcagctgctg 3 00 

ctggggcgcc cgctgatcgg ccacggcgcc 360 

ggggccggcg ggatcctgta cggcaacggc 420 

gccggcggtg cgggcggggc ggccgggctg 480 

ggcaccggcg cgtccggtgg tgccggcggg 540 

gccggcggca atggcggggt cggggtggcc 600 

ggtgccggcg gcgccgccgg attgtggggc 660 

ggggggQtcg gtggcggcaa gtccggcgac 72 0 

ggcggtggtg gctggctgca cggcgacggc 78 0 

accggcgtca gctcaggagg caacggcggg 840 

ctgtcgggca gcggcggggc cggcgggcgc 900 

ggggagaata acttcggggg cgcgggcggt 960 

ggcggtgccg gcggcaacgg cgggcaaggc 1020 

aatgcctggc tgatcggcga cggcggtgcc 1080 

ggcggcggcg ccggtggagc cggcggcgct 1140 

gggggg9 ca g gcgggaccgt caccagtccg 12 0 0 

gggtccgccg gtctgattgg ccacggcggc 1260 

gggccccjacg gcaatggcgg aattggtggt 13 2 0 
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gccggcgggg ccggtggcaa cggcggacaa ctctacggca ccggcggcac cggcggcacc 13 80 

ggcggcaagg gcggcgacgg cttcggcgtg ttcggcaagg gcggcgccgg cgggaccggc 1440 

gggcgaggcg gtgccgccgg cctgatcggc gacgccggga ccggcgggac cggcggaaag 1500 

ggcggcaccg ccggcgagga cggtaccggc ggcaacggcg ggaccggcgg aaacggtggg 1560 

5 gccgccgtcc taatcggcaa cggcgggggc ggcggcgccg gcggaaacgg tggggccggc 1620 

aacgatggca cccccggcaa cggcgggggc ggcggcgtcg gcgggaccgg gggaaccctg 168 0 

ttcggccagc ccggccaacc cggtccgccc ggccagcccg gccccgccta a 1731 

<212> Type : DNA 
10 <211> Length : 1731 

SequenceName : SEQ ID 548 
SequenceDe script ion : 



15 



50 



55 



Sequence 



<213> OrganisitiName : Mycobacterium tuberculosis H3 7Rv 
<400> PreSequenc eSt ring : 

gtgtggacgt cgcagatgat cgtggcgccg gcgtttgtcg atgcagcggc aaaggaccta 60 

gcaactattg gttcggcgat tagccgggcc aacgccgaag cgttggtccc gataacggcg 120 

20 ttactgcctg cgggcgctga cgacgtgtcg gctgcgattg cggcgctgtt cgcaacgcac 18 0 

gggcaggcat accaggagct cagtgcccac gcggtcgcat tccatgagca gttcgtccag 240 

ctcatgagcg cgggggcggc ccagtacgcc agcgccgagg cagccaactc gtcaccattg 3 00 

caaatcgtgg gccaaaccgc cctcgatgcc atcaattcac ccgtgcagac gctgaccggg 3 60 

cgtccgctta tcggcaacgg tgccaacggg gtcgcaggaa ccgggcaaaa cggtggcgat 42 0 

25 ggcggatggc tatacggcaa cggtggcaac ggcgggtccg gcgggacggg ccaaaatggc 480 

ggcaacggcg ggtcggctgg gctatggggt agcggcggca atggcggcca gggcggggcg 540 

ggtgccaacg gcgcagccgg tcaacccggg aaagcgggcg ggtccggcgg caacggcggc 600 

gccggtggat ggatctatgg tcacggtgga catggtgggg ccggcggaaa cgggggcaac 660 

gccacagcgc ccgggggtgc gtcggcaggc ttcgatgggg gcgccggcgg aaacgggggt 720 

30 tcgggtggtc gcggtggact gttgttcggc aacggcggca acggctcggt cggtggcatg 78 0 

ggaggacaag gcactaatga cacagccgga gattcggccg gcagcggcgg attaggaggc 840 

aacgggggca acggcgccca gggcgggtgg ctgatcggca acggggggca aggtggggac 900 

agcggcgccg gcggcggcac cgactccact caaacgggcg tcatgaacgg cgcttccggc 960 

ggttcggccg ggatagccgg taacggcggt gacgcgggcc tagtcggcaa cggcggggcc 1020 

35 ggtggcaacg gcgggaatgg agcggccggg tctgcgctgg gtactaccat cttcggcggg 1080 

agcggcgggg tcggtggctc aggcggcgac ggcggcaacg gcgggtggtt gttcggcagc 1140 

ggcgcgtccg gcggcaacgg gggtcagggc ggtgacgcag gcaccaacgg atttgcgggc 1200 

tttggcggct ctgctggcgg tggcggctgg gtaggtgccg ttaacttcgg accgattagt 1260 

gtccagggtt ttgggttgtt tggtcacggc ggtgacggtg gcaacggtgg tgacgttggg 1320 

40 gccggcagcc tcagcattca atttggtgca tcagggggtg acggcggcca aggtggggtg 1380 

ctatacggca atggcggcaa cggtggtaac gccggcagcg gcggaggtac cggcttcgaa 1440 

ggcagcgccg gccagggtgg cgccgccatt ctgatcggca acggcggggc cggagggaat 1500 

ggggcgaccg gtgggaccgg agtaggcaac attattcagg aagccggagg tgacggcagc 1560 

gacggtggcg cgggcggcag cggaggccta ctgttcggta. gcggcggggc cggcggcatc 1620 

45 ggcggggccg gcggcgtcgg cgggtcgggc aacgacggcg gcaacggcgg cgatggtggc 168 0 

caaggcgggg caagcggcct ggggatcggc aacggcggac ccgggggaag tggcggtacc 1740 

ggtggggccg gcggaaccgg cggcagtgcg ggcactggcg gtgccggcgg tgacggtggt 180 0 

aacgccgccc tgctgatcgg taccggtggc gacggcggag atggtgtccc acccgcaccc 18 60 

gggggccaag gcggcaaggg cggattgatc ggtttgcctg ggcagaacgg gcagccgtag 1920 



<212> Type : DNA 

<211> Length : 1920 

SequenceName : SEQ ID 549 
SequenceDe script ion : 

Sequence 



<213> OrganismName : Mycobacterium tuberculosis H3 7Rv 
<400> PreSequenceString : 

60 atgtcatggg tgatggtttc gccggagctg gtggtggcgg cggcagcgga tttggcgggg 60 

atcgggtcgg cgattagctc ggctaatgcg gcggcggccs tcaacacgac gggattgttg 12 0 

accgcgggtg ccgatgaggt gtcgacagcg attgcggcgt tgttcggtgc ccaaggccag 180 

gcctaccagg cggcgagcgc acaggcggcg gcgttttacg cccagttcgt gcaggccctg 240 

agcgccggcg gaggcgcgta tgcggccgcc gaggccgccg ccgtgtcgcc gctgctggcc 300 

65 ccgatcaacg cgcaattcgt ggcggccacc gggcgcccgc tgatcggcaa cggcgccaac 3 60 

ggcgcccccg ggaccggagc caacggcggg cccggcgggt ggttgatcgg caacggcggc 420 

gccggcgggt ctggcgcccc cggcgctggg gccggcggta. acggcggggc cggcgggctg 480 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



ttcggcagcg gcggggccgg cggggcctcc 
ggcggggccg gcggaaacgc cggcatgctg 
ggattctcga acggcggtgc caccggcggg 
tttggcgccg gaagggaacg cggcagcggc 
ggggccggcg gcaacgccgg gacactcgcc 
ggcgctagtc gcagcggcgg attcggcggg 
ttcttcggct ccggcggctc cggcggcgcc 
gccgccggcg gggccggcgg ggcccccggg 
ggcggcgcga gcaccggcgg cggggacggt 
ttgatcggca acggcggcaa cggcggcagc 
ggcatcggcg gtaccggggg ggtgctgttg 
acctcgcccc tgcacaccct- gcagcaggac 
acgctcaccg ggcgtccgct gatcggcaac 
gacggcggag ccggcggctg gttgttcggc 
ggcggcgtca acggcggggc cggcggggcc 
ggcggcaccg ggggcagcgg cgggcccggc 
ggcggagccg ccttgctctt cggctccggc 
gtcggtggca atggcggggc cggcggcaac 
ggcggggccg gtggtgccgg cgcggtcggt 
ctgttcgcca acgggggagc cggcgggccc 
gggatcggcg gggcaggtgg gaacggcggg 
ggcgggggaa gcaccctcgc cggcggcgcc 
ggcgccggcg gcaccggcgg cgccggcagc 
gccggcgggg ccggcggcga cgccggcttg 
ggcagcggcg gttccagcct gaccgccgcc 
ggcttgctct tcggctccgg cggcgccggc 
ggcggcgccg gcggggccgg cggcgacgcg 
ggggccggcg cctccgccac cggcgccgcc 
tccggagcgt tcggtctcgg aggtgacggc 
gctttccaca tcggcggcaa gggcggcgtc 
ggcaacggcg gcaacggcgg taacagcggt 
cccagcggcg ccggcggcgc cggcgggctg 
atgtag 

<212> Type : DNA 

<211> Length : 24 OS 

SequenceName : SEQ ID 550 
SequenceDescription : 

Sequence 



accgacgtcg 

ttcggcgccg 

gcaggcgggg 

gggtcgggca 

actggtgatg 

gccggcggag 

ggcggcatta 

ctgatcggca 

gggcccggcg 

ggcgggaccg 

ggcctggacg 

gtgatcaata 

ggcgccaacg, 

aacggcggaa 

ggcggggccg 

gccaccggcc 

ggggccggcg 

gccggtgcgc 

ggcaatggcg 

ggtgggtttg 

ctgttcggcg 

ggcggggcgg 

catagcaccg 

ctctccctcg 

ggcgtggtcg 

gggagcggcg 

ggtttgctcg 

accggcgggg 

ggcgccggcg 

ggcggcagcg 

aacgccggga 

ctgctcggtg 



ccggcggggc 

ccggggtcgg 

ccggcggggc 

acctcactgg 

gcggggccgg 

ccggcggcga 

gtaaaagcgt 

acggcggcaa 

gggccggcgg 

gcgcgaccct 

gctttacggc 

tggtgaacga 

gcactccggg 

acggcgggca 

gcgggatctt 

tcggcgggat 

gaagcggtgg 

tcttgggcgc 

gggccggcgg 

gcagccccgc 

ccggcgggac 

gcggcaacgg 

ccgccggagt 

gcgcctccgg 

gcggcatcgg 

ggttcagcaa 

tcggctccgg 

acggcggggc 

gcgccaccgg 

ccgtgctgat 

aatccggggg 

agaacgggct 



cggtggggcc 
cggcgtcggc 
gggcgggctg 

cggggccggc 

ccjggaccggc 
cgjccggcatg 
cgjgggacagc 
cgjgcggcaac 
ca_ccggcgtg 
ggjgcaaggcc 
ccccgccagc 
ccccttccag 
ga-ccggggct 
g^gaacgatc 
gttcggcacc 
tggcggggcc 
tcjccggcgcg 
cg-ccggggcc 
taacggcggg 

tggggctggc 
cggcggggcc 
cgggctgttc 
tfcccggaggg 
cggggccggc 
cggcgccgga 
cfcctggcaac 
cggggccggc 
cggcggcaag 
tttgtccggt 
cggcaacggc 
tgcacccggc 
gaacggcttg 



65 



<213> OrganismName : Mycobacterium tuberculosis H37Rv 
<400> PreSequenceString : 

ggccagagct atcaagcggt cagcgcccag gcggcggcgt ttcatgaccg 
ctgcttaacg ccggtggagg ttcatatgcg agcgccgaga ttgccaacgc 
ctgctgaacg cggtgaacgc gcccacccag acgctgctgg ggcgtccgct 
ggcgccgacg gggccagtgg tccggtggga cagcccggcg gggacggcgg 
ggcaacggcg gcaacggtgg cgacagcacg agccccgggg ttgccggggg 
tcagcggggc tgatcggcaa cggcggcagg ggcggcaacg gtgcgcccgg 
ggcaatggcg gcctgggcgg attgctgctg ggcaacgggg gtgccggggg 
accggtgaca acggtgtggg agacctcggt gctggcggcg ggggaggcga 
ggtggacggg cggggctgat cggtcacggc ggtgccggcg gaaacggtgg 
cacggcggga gcggcaaggc cggcggcagt ggcggcagtg gcggcttcgg 
ggcgccggcg ggctgctgta cggcaatgga ggggcggccg gttccggcgg 
gatgcaggta ccggcgtctc cagcgacggt ttcgccgggt tgggcggcag 
ggcggcgacg cggggctgat tggtgtcggc ggcggcggcg gcggcaacgg 
gggctcggtg cgcgcctgtt ccaggtaggt agtcgcggcg gcgacggtgg 
tggctgtacg gcgatggcgg cggcggcggc gacggcggta atggggggct 
ggctccacca acgccggcaa cggcggcagc gcgcggctca tcggcaacgg 
ggtagcggcg ggagcggcgc gcctggctcg gtcagcagcg gtggcgtcgg 
aaccccggcg gcagcggtgg caacggcggc gtgtggtacg gcaacggtgg 
gccgccggcc aaggggggcc cggcatgaac accacctcgc ccggcgggcc 
ggcgggcacg gcggcaccgc catcttgttc ggcgacggtg gcgccggtgg 
gccggcggac ccggtactcc ggacggggcc gcggggcccg gcggcagcgg 
gggctgctgt tcggagtccc cggcccgtcc ggcccggacg ggtaa 
<212> Type : DNA 
<211> Length : 13 05 

SequenceName : SEQ ID 551 



gttcgtccaa 
gcagcagaac 
ggtcggcgac 
coitcttgtgg 
agccggcggg 
cggtgcagga 
agtcggcggt 

tsgcggtttg 

ggacggtggg 
ccagttcggt 
caacggtggc 
cggtggccgg 
cggcgaccca 
g-gtcggcggg 
gccatttatt 
tggtgccggc 
gggcgcgggc 
cgccggcggg 
9jggcggtgtc 
cjgctggcgcc 
cggcaccggc 



r 



540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2406 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1305 
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SequenceDe script ion : 
Sequence 



5 <213> OrganismName : Mycobacterium tuberculosis H3 7Rv 
<400> PreSequenceString : 

atggctcatt tttcggtgtt gccgccggag atcaactcgt tgcggatgta cctgggtgcc 60 

ggttcggcgc cgatgcttca ggcggcggcg gcctgggacg gcjctggccgc ggagttggga 120 

accgccgcgt cgtcgttctc cfccggtgacc acggggttaa ccgggcaggc gtggcagggc 180 

10 ccggcgtcgg cggcgatggc cgccgcggcg gcgccgtatg cgggcttttt gaccacagcc 240 

tcggctcaag cccagctggc tgccgggcag gctaaggcgg tggccagcgt gttcgaggcc 30 0 

gccaaggccg cgatcgtgcc tccggccgcg gtggcggcca accgtgaggc gttcttggcg 3 60 

ttgattcggt cgaattggct ggggctcaac gcgccgtgga ccgccgccgt tgaaagcctt 420 

tacgaggaat actggg'ccgc tgatgtggcg gcgatgaccg gctatcacgc cggggcctcg '" 48 0 

15 caggccgccg cgcagttgcc gttgccggcc ggcctgcaac agttcctcaa caccctgccc 540 

aatctgggca tcggcaacca gggcaacgcc aacctcggcg gcggcaacac cggcagcggc 60 0 

aacatcggca acggaaacaa aggcagctcc aacctcggcg cjcggcaacat cggcaataac 66 0 

aacatcggca gcggcaaccg aggcagcgac aacttcggcg ccggcaacgt cggcaccgga 720 

aacatcggct tcggcaacca gggccccata gacgttaacc fccttggcgac gccgggccag 78 0 

20 aacaacgtgg gcctgggcaa catcggcaac aacaacatgg gcttcggcaa caccggcgac 84 0 

gccaacaccg gcggcggcaa caccggcaac ggcaacatcg gtggcggcaa caccggcaac 900 

aacaacttcg gcttcggcaa caccggcaac aacaacatcg gaatcgggct caccggcaac 960 

aatcagatgg gcatcaacct ggccgggctg ctgaactccg gcagcggcaa tatcggcatc 102 0 

ggcaactccg gcaccaacaa catcggcttg ttcaactccg cjcagcggcaa catcggcgtc 1080 

25 ttcaacaccg gagccaatac cctggtgcct ggcgacctca acaacctggg cgtcgggaat 1140 

tccggcaacg ccaacatcgg cttcgggaac gcgggcgttc tcaacaccgg cttcgggaac 1200 

gcgagcatcc tcaacaccgg cttggggaac gcgggtgaat taaacaccgg cttcggaaac 12 60 

gcgggcttcg tcaacacggg gtttgacaac tccggcaacg tcaacaccgg caatgggaac 1320 

tcgggcaaca tcaacaccgg ctcgtggaat gcgggcaatg tgaacaccgg tttcgggatc 13 80 

30 attaccgaca gcggcctgac caactcgggc ttcggcaaca ccggcaccga cgtctcgggc 1440 

ttcttcaaca cccccaccgg ccccttagcc gtcgacgtct ccgggttctt caacacggcc 1500 

agcgggggca ctgtcatcaa cggccagacc tcgggcattg gcaacatcgg cgtcccgggc 1560 

accctctttg gctccgtccg gagcggcttg aacacgggcc tgtttaacat gggcaccgcc 1620 

atatcggggt tgttcaacct gcgccagctg ttggggtag 1659 

35 <212> Type : DNA 

<211> Length = 1659 

SequenceName r SEQ ID 552 
SequenceDescription : 



40 Sequence 



<213> OrganismName : Mycobacterium tuberculosis H37Rv 
<4 00> PreSequenceString : 

atgtcgttcc tgattgcttc gccggaggcg ctagcggcga cagccacata tttgacaggt 60 

45 atcggttcgg caatcagcgc ggcgaacgcg gtcgcggccg ccccgacaac agagatcctg 12 0 

gcggcgggga ccgacgaggt gtccaccgcc atctcagcgc tgttcggcgc tcatgcccag 180 

gcatatcagg cgctcagcgc ccacgtggcg gcatttcacg accagttcgt gcataccttg 240 

accgccggtg ccggctcata catggccgcc gaggccgccg ccgcctcgcc tctgcaggct 3 00 

ttgcagctgg agctgctcaa cgccatcaat gcacccaccc tggcgctgtt gggacgcccg 3 60 

50 ttgatcggcg acggcaccga tgcggcgccg gggagcgggg gggccggcgg ggccggcggc 420 

atcttgatcg gcaacggcgg gaccggcggc gccagcgact tagccgggac cggccgcggc 480 

ggggtcggcg gggcgggcgg cgccggcggg ctcttcggca tcggcggcgc cggcgggggc 540 

tgcgggtccg cggtggcgat cgggggtgac ggcggggctg gtggcgccgg cggcgtgttc 600 

agcggcggcg gcgccggcgg ggccggcgac gccatcgggg gtagcggcgg cgcgggcggc 660 

55 accggtgggc tgttgggtgg tggcggcggc gcgggcggcg ccggcggcgc cggcggcaat 720 

ggcgggggcg ccagcaacag cgcaagtatc gggggtgacg gtgggtccgg cggcgcgggc 78 0 

ggcatgctct acggtgccgg cggcgtcggc ggcaacggcg gggccgcggt cgctatcggg 84 0 

ggtgacggcg gggccggcgg cagggccgga gcgatcggca acggcggtga cggcggcaac 900 

ggcgggactt ccaacacccc cggcggtagc ggcggcgacg gcggcaatgg cgggaacgcc 960 

60 ggactgatcg gcaacggcgg taacggcggc aacgccgaga ttgtcatctc cggcggtagc 102 0 

gtcgccggca ccggtggcaa cggcgggttg ctgttgggct tcaacggcac gaacgggctg 108 0 

ccgtag " 10Q6 

<212> Type : DNA 
<211> Length : 1086 
65 SequenceName : SEQ ID 553 

SequenceDescription : 
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Sequence 



<213> OrganisraName : Rickettsia prowazekii strain Madrid E 
<400> PreSequenceString : 

5 atgaaaaaat caaaaatttt aagaaaattt ttagcaacag cctcacta-tg tgggacatta 60 

ttcactaact ctaatgcaac aggaacaatt attcctaata atggtagcgt aagcttgaat 120 

actgacgcag gtcttgtagg aggagtattt aacaatggcg atattatcca aatagttaat 180 

993-gggcgtg aaattaaaat atcggcagat aaagcaaatg ctatcattgg aggaatcaat 240 

acattaaaag aactgcctga ttttggtggt gttgaagtaa gtcaaaacgt ctcaataggc 3 00 

10 cctcttaatg caggagaaga tcttaatact aattttggcc ctcttaaa-tt tattagtaat 3 60 

aatgttacgt caattattac aggagtcggt actaaaacat ttagtaafcat cgattttgct 420 

ggtaaaaatg ctactttaca aattaataaa gatttaaata ttacaactaa aatagataat 480 

acagtagccg gaaataatgg ttcaataaca tttgaaggta gcggtattat atcaaatcac 540 

ataggctaca ctaactctct tttaggaata aatgtaggaa acggagaa.gc> caagatttat 600 

15 gccccagaag caaataatat tacaattaat gctaaaaata taaatcttac tcacaataac 6 60 

tctatactta ctctttgtga cggtaacata actacattaa agggtaatat aaataatact 720 

acagaaattg acggtcaagg tatattaaat ctagcttatg atcttggtag tagtagcata 780 

ataacaggtg atataggtaa tataggctca ttagatacaa taaatgtttt acttggatct 840 

gcaacattta attctacaat attaaaagcg actaatatta atttaaaaca taatacttca 900 

20 acactcaatt tagatgataa cataattgtt attggtaata taaaaggtaa taataacaaa 960 

gatatattaa attttaaagt gcatggtact aatttagata acgaaatgat tattcctgct 1020 

cctcaaaaaa ctcatggtac attaaatttt aaaggaaatg ctacacttaa tgggaatata 10 80 

aataatttaa atatacttaa gtttagtgga ggtcacggta aaactttgaa tttacaaggt 1140 

aatactaaag tagataatct tgtttttgca gatagtgttt tagattcagg tactataagt 1200 

25 gttaacgggt tgttagatac agactgtgtg acatttaaca atagtaatgt taacggcgga 1260 

acattaataa taaatgccaa gaacacaatc agtgcaaaat tattaaatgc tacaaaagca 1320 

aaaatacaaa tcaatgctaa tttaacgatg aatcatccaa gtgctggcjga tataagtgat 1380 

attagaatag cggataatac aatctataca atagatgcaa aaaatgggaa tgtaaatttg 1440 

ctaaataaca acgcaaagat catatttgaa ggagctgatt ctatgttaigc tttaatcaat 1500 

30 actggtgtta cagctgatag aacctttacc atatataata atttaaatca atctggcaat 1560 

gatgaatatg gaatagtaaa aatagaagca attaaaaaag taataactat agcaaatcaa 1620 

agtggacctt atactatagg gcaggataat acacatcgtc ttaaggaatt aatagtagag 1680 

ggagcaggtg atatcataat agatgatacg atatttacga agttacttag tataaacagt 1740 

acaggacaaa taacatttaa tcgtacttta gatttaggtg caggtggtaa tattgcattt 180 0 

35 ggaaagcatg gtacattagt agtaaacggt gtcactggtt caattacaac ttctgaaaat 1860 

aatcaaggaa tattaacaat caatagcggg aatattactg gtgttattgg cactaatgaa 1920 

ctcggcttaa aacttgtcaa tattggtgca gatcctgtta cgtgctcagc aaatgtattt 198 0 

gcatcggtag ctttaactaa cccaagttct gtgttaattt tagcagatgg tgttacgcta 2040 

actggtgaag taacaacaca taataataca aaaggtgtat tatcacttgg aacaggaagt 2100 

40 aatataacag gtcaaatcgg tactaatagc gcagctcttg agaaaataaa tafctggggct 2160 

ggagctagta atattgacag taatatatat gcaggttcta cagtactcac agatcaaaca 222 0 

tcggaattaa ctttaaacaa tgatgtagtc gttaacagta atatcataac tactgccggg 228 0 

aacaatagcg gaaagttaat atttacaggt aatggcggca taacaggaaa cataggagca 23 40 

aatggtgcag ctttacaaga ggttgtattt aacggtacta ctaatatagg tggtacagcg 2400 

45 aactcacaga actttactgt tgcacattct gcagcaaatg tggtgattac aggacttact 2460 

actggtgcat taaaatacaa agatactggt acaattatag ctcatgg"a.gg attggtagga 252 0 

gacatcgatt ttaataataa agctggtaaa ttcattttag gcgatggtgc tatgattgat 258 0 

ggatcagtgt tatgtaatgg aggggttgct ggtacattgg attttafcagg tgacggtaat 2640 

gtaactcaaa atataggcgc agataatgca aatagtattt cgactatcaa cattcaaggc 270 0 

50 gataatacaa aaaacgtaac tatagcaaat gatatatttg tagataatat tcattttaca 276 0 

aacggtggga tattacagct tggcggaaat ctcacaacgc ataatattga ttfccggagca 282 0 

aatggtggta ctttagaatt taatggtaat aatacatata acttaaafcgc tattattgta 28 8 0 

aacggacaaa acggtatatt aaatgctttc acaaacttaa aggctagcga tgatactatt 294 0 

ggtacggtta aaataattaa tataggacaa ataggaacac cgcaaaactt tactattcaa 3 00 0 

55 gttaataaca aaaatttaac tctagtaagt agcgtaaata gtagcattaa ttttggtgat 3 060 

gctaattcgc aattaatatt atctgcacca gtagatcaaa ctattaaatt tattaataat 312 0 

ttaaacgaaa ctggaggtgg tattatcact ttagatagta atggtaataa tttaaccata 318 0 

agtggtaata atggaataaa gcttggtagt aaaggtaatg agttatctag cttaaatatt 324 0 

aaaggaaaag ttactgtaac taatgattta gatatacaaa atattcatca attaaatata 33 00 

60 aacaatggtg cattatttga tgaccaaagt ctcacatctg ctaaaatcaa aaatataaat 33 60 

attggtacag tagcaggcgg agctacttat actttagatg ctataaatga taattttgat 342 0 

ttaaatacta gtggtatggt atttaaacat caagattcaa tattagaact gaaaaatagt 348 0 

tcaaatacta atgaccacac tataacatta acatctgctt tagatccagg taataatcaa 354 0 

tttggtataa ttaaactcat aacagatact aataaattaa ctatagacaa taatggcaat 3600 

65 gtggcttata cactcggtac agcaaatcat atgttgaagc aattaacttt tgctagtata 3660 

gataatgggg ctatagcttt aaaagtaggg atcaatgttg aaaatgttac cttaaatatt 3720 

aaggatatag agttaaatga ggtgaatgca aatgttctgt ttaacaaaaa cacaacatat 3780 
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accgcaacag gtaatataaa cggtcatgta gatttccaag gtaatgcagg tgtaataaat 3840 

cttaatgatg atatagaaat tgacggtagt gttacaagta ccggtaaLgu aaacggtaca 3 900 

ttaaatttca atggatcagg taaagtgact ggtttaatca ataacatagt aatgctgcaa 3960 

gcaggagcag gtgatgtatc actatctgct agcggtaatt attctattac tgaaattcaa 4020 

5 ggtaacggta ataataattt gacatttgct gctaattcac acttaacaac tgatattaat 4080 

aagacaggcg gtcaggattt aaatttagta ttcataaatg gtggtagcgt tagcggttct 4140 

atcggcgcaa atgcagcagt tggtgatatt attataaacg caggaagtgt aaatttcagt 42 00 

aatactctta aaagcggtaa tattgttata tcggatggtg ccacgatgca agttaataat 4260 

aacgtaactg ctactgatat ctcaggcaaa aatgcaaata acggaacttt aaaactaaat 4320 

10 aatcatacac ctattaatat cactagtaca cttggtaata ataatgctat agggaccata 43 8 0 

gaagtcgcaa ataatgatgt taccattaca gggacattac aagcgcaaaa tattcatttt 444 0 

tcaaatgcca ctcaggcagc taccttaact ttaggcgcag catcgcaagt gactaatatt 45 0 0 

actacggcag gaaataatat tcatacttta gaagtaacag attttgatac tggtaatgac 45 60 

ggcataatag gagatgcaaa taatagatta aaatcaatag aattg^cagg caatggcaca 462 0 

15 gtgactatta attctccaca tgtttattca tctattacta ctgcaaataa cgcgcaaggt 468 0 

aacgtaaaac tcaatataga aggcggtatt acttatgatt taggaagtaa aataaaaagt 4740 

ttagcaaatg tacaaattag tgaggatact actattagag gtgatgtgta ttctaaatat 48 00 

cttaatatag acgcaggtaa aactataaat tttgatagag gtgataataa catgaatccc 48 60 

aaaaatttag atataccaga tgctctaata gatttagatg tattaccacg ctcgctctca 492 0 

20 ctatttaatt acttcactga tattaaagcg gataatttaa attttgcaga tgatactgct 49 80 

acagcaaatt fctaaagatgc tgttgtaata gatgcacata ttgataatgg tggcatttta 5040 

aaatttaatg acaatgcttg gttaacacaa gaaattaaaa atgctaacat catagaaatt 5100 

gcatctgata aatttatgtt gcttcagaaa aacattaaag cagctacttt aatagctgat 5160 

aatgcaaatt tagtattatt agataatgtg gaagtaaata ctaatctaaa tgttagagat 52 2 0 

25 attgtattag atttagctaa ttatgaatta aaatatactg gtaatcjttac acataatgga 52 80 

ttactgacta ttatcactta ttttgatact gcattacaaa aaggtgggca tatattagtt 53 4 0 

agtcaaggat ctaatgtcga tatgtccgat ttagataatt taataattaa aattaaagca 54 00 

cactccgata tcactaacat tacctcagat actaaacacc aaatagtaaa actcgaaaca 54 60 

ggtgcaatat atactccagt accgcaaact aaagtcatta ttgatgcaag cgaggaacaa 5520 

30 aataaatttg tgaaatgggt cgctgatgca aacggtttgg tattacttac tgatactgga 55 80 

ggtcgagacg atactggagg tcgagacgat actaggggtc gaggcaatac tgacaatggc 5640 

tgtcgtgata attgtgatgt agggaatatc agcaataaca gtagtaatga agcaggtggg 57 00 

tcaagtagcg ataaaaatta tggcatcact gatgttgtac caatttttga tccatctcct 57 60 

atcttagatt atactaaaaa taattatgta gcttctggta tagcaaacca acttattaat 58 2 0 

35 catgttaaag attttggtaa tactactgat gcaggtaaat tattaaatga tttaggtttt 58 80 

atgtctccga atagagttac tgaaacatta gatagactta gtaatagaat aaatgttaat 5940 

ggacttaatg aaggagtagt aggactgaac ggtatcgagg ttgagaattt tttaacagat 60 00 

atagcaataa atatggataa tttcactgct aaagagattg gcaataggtt agaagaatta 60 60 

agcgatgcaa atactgtaaa tggtctcaac aaaacaaaca cattgcttaa taataagatt 6120 

40 aatctaaaaa gactgaatac taataatcag gcaataattg ctgcacjgtga tgaagataat 618 0 

atagtaacgg gcatttgggg catgtcattt tatggtaaaa taaagcaaaa ctctaaaaac 6240 

agtgcaagcg gttatcaatc taatacaggt ggtggtataa taggctttga ttataatatt 63 00 

gataattcta tagttatagg ggcggcttat actatggctg atagtaaagt caagcacaaa 63 60 

aatgataaaa atggtgatag aaccaaagct aaaagtaata tatatfcctat ctatgggctt 642 0 

45 tataattggc ttactaataa cttttttgtt gaagctatag gtgtgfcatgg tagaaataaa 64 80 

atcaaaaatt atgaaaaacg tataactact attactgatc aaatcgcaat aggtaaattt 654 0 

attaatactt tttatagcta tgaattacta ggtggctata actatctaat atcgcatcgt 6600 

accactataa cgccaatgtt tggtatgcgt tatgctacat ttaaaaataa tggttacaaa 6660 

gaaaataata ctactttcca aaatttatct ataaagaaaa attactatga taaatttgaa 672 0 

50 actatattag gtttaaatag tgtaactcat tatttatcac aagatataat aataaagccc 67 80 

gaattacatt ggtttataaa ttatcaatgt aaaaataagt taccaaatat tgatgcacgc 6840 

cttgacggta tagatgaacc attgacaaca attagattta aacctgcaaa gataacatat 69 00 

aatttaggcg gtggtatttc tactaaaaat aatatgatag aatttggtat tagatataac 6960 

ttatctcttg cgaagaaata tacagcacat caaggatcct taaagattaa agtgaacctg 7020 

55 taa 7023 
<212> Type : DNA 
<211> Length : 7023 

SequenceName : SEQ ID 554 
SequenceDescription : 



60 



Sequence 



<213> OrganismName : Rickettsia prowazekii strain Madrid E 
<40 0> PreSequenceString : 
65 atggctcaaa aaccaaattt tctaaaaaaa ataatttccg caggattggt aactgcttcc 60 
acggctacta tagtagctgg tttctctggt gtagcaatgg gtgctgctat gcaatataat 120 
aggacaacaa atgcagcagc tacaaccttt gatggtatag gctttgatca agctgctggt 180 
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gctaatattc ctgtcgctcc aaattcagtt attactgcta atgctaataa tcctattacfc 240 

tttaatactc caaacggtca tttaaatagt ttatttttgg atactgcaaa fcgatttagca 3 00 

gtaacaatta atgaggatac taccttagga tttataacaa atattgctca gcaggctaag 3 60 

ttctttaatt ttactgttgc tgctggtaaa attcttaaca taacagggca gggtattact 42 0 

5 gttcaagaag cttctaatac aataaatgct caaaatgctc ttacaaaagt gcatggtggc 480 

gctgctatta acgctaatga tcttagcggg ctaggatcaa taacctttgc tgctgcgcct 540 

tctgtattag aatttaattt aataaatcct acaactcaag aagctcctct tacacttggt 600 

gctaattcta aaatagttaa tggtggtaat gggacattaa atattactaa tggatttatt 660 

caggtttcag ataacacttt tgctggtatt aagaccatta atatcgatga ttgtcaaggt 720 

10 ttaatgttta attctactcc tgatgccgct aatactttaa atttacaagt aggtggtaat 780 

actattaatt ttaatggaat agacggtact ggtaaattag tattagtcag taagaatggt 840 

gctgctaccg aatttaatgt tacaggaact ttaggtggta atctaaaagg tattattgaa 9 00 

ttgaacactg cagcagtagc tggtaaactt atctctcaag gaggtgctgc taatgcagta 960 

ataggtacag ataatggagc aggtagagct gcaggattta ttgttagtgt tgataatggt 1020 

15 aatgcagcaa caatttctgg acaagtttat gctaaaaaca tggtgataca aagtgctaat 10 80 

gcaggtggac aagtcacttt tgaacacata gttgatgttg gtttaggcgg taccaccaac 1140 

tttaaaactg cagattctaa agttataata acagaaaact caaactttcjg ttctactaat 12 0 0 

tttggtaatc ttgacacaca gattgtagtc cctgatacta agattcttaa aggtaacttc 1260 

ataggtgatg taaaaaataa cggtaatact gcaggtgtga ttacttttaa tgctaatggt 13 2 0 

20 gctttagtaa gtgctagtac tgatccaaat attgcagtaa caaatattaa tgcaattgaa 13 80 

gcagaagggg ccggggttgt agaattatca ggaatacata ttgcagaatt acgtttaggg 1440 

aatggtggct ctatctttaa acttgctgat ggcacagtaa ttaatggtcc agttaaccaa 150 0 

aatgctctta tgaataataa tgctcttgca gctggttcta ttcagttaga tgggagtgct 1560 

ataattaccg gtgatatagg taacggtggt gttaatgctg cgttacaaca cattacttta 1620 

25 gctaacgatg cttcaaaaat attagcactc gatggcgcaa atattatcgg ggctaatgtt 1680 

ggtggtgcaa ttcattttca agctaacggt ggtactatta aattaacaaa tactcaaaat 1740 

aatattgtag ttaattttga tttagatata actactgata aaacaggtgt tgttgatgca 180 0 

agtagtttaa caaataatca aactttaact attaatggta gtatcggtac tgttgtagct 1860 

aatactaaaa cacttgcaca attaaacatc gggtcaagta aaacaatatt aaatgctggc 192 0 

30 gatgtcgcta ttaacgagtt agttatagaa aataatggtt cagtacaact taatcacaat 1980 

acttacttaa taacaaaaac tatcaatgct gcaaaccaag gtcaaataat cgttgccgcfc 2 040 

gatcctctta atactaatac tactcttgct gatggtacaa atttaggtag tgcagaaaat 2100 

ccactttcta ctattcattt tgccactaaa gctgctaatg ctgactctat afctaaatgta 2160 

ggtaaaggag taaatttata tgctaataat attactacta acgatgctaa tgtaggttct 222 0 

35 ttacacttta ggtctggtgg tacaagtata gtaagtggta cagttggtgg acagcaaggt 22 8 0 

cataagctta ataatttaat attagataat ggtactactg ttaagfctttt aggtgataca 2340 

acatttaatg gtggtactaa aattgaaggt aaatccatct tgcaaatfcag caataattat 2400 

actactgatc atgttgaatc tgctgataat actggtacat tagaatttgt taacactgat 2460 

cctataaccg taacattaaa taaacaaggt gcttattttg gtgttttaaa acaagtaatt 2520 

40 atttctggtc caggtaacat agtatttaat gagataggta atgtaggaat tgtacatggt 2580 

atagcagcta attcaatttc ttttgaaaat gcaagtttag gtacatcttt attcttacct 2640 

agtggtactc cattagatgt tttaacaatt aaaagtaccg taggtaatgg tacagtagat 270 0 

aattttaatg ctcctattgt agttgtatca ggtattgata gtatgatcaa taacggtcaa 2760 

atcatcggtg ataaaaagaa tattatagct ctatcgcttg gaagtgataa cagtattact 2 82 0 

45 gttaatgcta atacattata ttcaggtatc agaactacaa aaaataatca aggtactgtg 2880 

acacttagtg gtggtatgcc taataatcct ggtacaattt atggtttagg tttagagaat 2940 

ggtagtccaa agttaaaaca agtgacattt actacagatt ataacaactt aggtagtatt 3 000 

attgcaaata atgtaacaat taatgattat gtaactctta ctacaggagg tatagcaggg 3 060 

acagattttg acgctaaaat tactcttgga agtgttaacg gtaacgctaa cgtaaggttt 312 0 

50 gttgatagta cattttctga tcctagaagt atgattgttg ctactcaagc taataagggt 3180 

actgtaactt atttaggtaa tgcattagtt agtaatatcg gtagtttaga tactcctgta 324 0 

gcttctgtta gatttacagg taatgatagt ggggcaggat tacaaggcaa tatttattca 33 00 

caaaatatag attttggtac ttataattta actattctaa attctaatcjt cattttaggt 33 60 

ggtggtacta ctgctattaa tggtgaaatc gatcttctga caaataattt aatatttgca 3420 

55 aatggtactt caacatgggg tgataatact tctattagta caacgttaaa tgtatcaagc 34 80 

ggtaatatag gtcaagtagt cattgccgaa gatgctcaag ttaacgcaac aactacagga 3 540 

actacaacca ttaaaataca agataatgct aatgcaaatt tcagtggca'c acaagcttat 3600 

actttaattc aaggtggtgc tagatttaat ggtactttag gagctcctaa ctttgctgta 3 660 

acaggaagta atattttcgt aaaatatgaa ctaatacgtg attctaacca ggattatgta 3720 

60 ttaacacgta ctaacgatgt attaaacgta gttacaacag ctgttggaaa tagtgcaatt 3780 

gcaaatgcac ctggtgtaag tcagaacatt tctagatgct tagaatcaac aaatacagca 3840 

gcttataata atatgctttt agctaaagat ccttctgatg ttgcaacatt tgtaggagct 3900 

attgctacag atacaagtgc ggctgtaact acagtaaact taaatgatac acaaaaaact 3960 

caagatctac ttagtaafcag gctaggtaca cttagatatc taagtaatgc tgaaacttct 4020 

65 gatgttgctg gatctgcaac aggtgcagtg tcttcaggtg atgaagcgga agtatcttat 40 80 

ggtgtatggg ctaaaccttt ctataacatt gcagaacaag acaaaaaagg tggtatagct 4140 

ggttataaag caaaaactac tggggttgta gttggtttag atactctcgc tagcgataac 4200 
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ctaatgattg gggcagctat tgggatcact 
aaaggtgata aaactgatat taatggttta 
gttaagaatt tctttgctca aggtaatgca 
agtcagcgtt acttcttcga gtctaatggt 
5 tacgataaca tgacatttgg tggtaattta 
aatgtattag taactccaat ggcaggactt 
aaagaaaccg gtacaacagt tgcaaataag 
gatttaatag taggggctaa agtagctggt 
tatccggaaa ttcattcttt tgtggtgcac 
10 tctatgttag atggacaaac tgctccattt 
tcttataata taggcttaag tgcaaacata 
ggttatgatt ttaattctgc aagtaaatat 
gtaaacttct aa 
.„ <212> Type ; DNA 
15 <211> Length : 4932 

SequenceName : SEQ ID 555 
SequenceDescription : 

Sequence 

20 

<213> OrganismName : Porphyromonas gingivalis W83 
<400> PreSequenceString : 

atggcacgaa ttatcttgga ggctcacgat gtatgggaag acggcacagg ctatcaaatg 60 

ctctgggatg cagatcacaa tcagtacggc gcatccattc ccgaagaatc tttttggttfc 12 0 

25 gccaacggaa cgatcccggc cggtctttac gatcctttcg agtataaagt tccggtcaat 180 

gccgatgcat ctttttctcc cacgaatttc gtgcttgatg gaacagcatc agccgatatfc 240 

cctgccggca cttatgacta tgtaatcatt aaccccaatc ctggcataat atatatagta. 3 00 

ggagagggtg tctccaaagg taacgattat gtggtagagg ccggtaagac ttatcatttc 3 60 

actgtccaac gacaaggccc cggcgatgct gcgtccgttg tagtgaccgg agaaggtggc 42 0 

30 aatgaattcg ctcccgtaca gaatctccaa tggtctgtat ccgggcagac agtgaccctc 480 

acttggcaag cccccgcatc cgacaaacgg acttatgtgt tgaacgaaag cttcgatacg 540 

caaacgcttc ctaacggctg gacaatgatc gatgctgatg gtgatggtca caattggcta. 60 0 

tctacaataa acgttfcacaa cactgctact catacaggtg acggtgctat gtttagcaaa. 660 

tcatggacag ctagcagtgg tgcaaaaatt gatttgagtc ctgacaacta tttggtaact 72 0 

35 cctaagttta cggttcctga gaatggtaaa ctttcttatt gggtttcatc tcaagagcct 78 0 

tggactaatg agcattatgg agtgttcttg tccacaaccg gaaacgaggc tgcaaacttt: 840 

acgataaagc tgctggaaga aaccctcgga tccggcaaac ctgctccgat gaacttggtcj 9 00 

aagagtgaag gagtaaaggc tccggcacct tatcaggaaa gaaccatcga tctctctgcc 960 

tatgccggac aacaggtgta cttggcattc cgtcatttcg gctgtacagg tatattccgfc 102 0 

40 ctttatcttg atgacgtggc tgtttctggt gaaggttctt ccaacgacta cacgtacacg 10 8 0 

gtatatcgtg acaatgttgt tatcgcccag aatctcacgg caacgacatt caatcaggaa. 114 0 

aatgtagctc ccggccagta caactactgt gttgaagtta agtacacagc cggcgtatcfc 12 00 

ccgaaggtat gtaaagacgt tacggtagaa ggatccaatg aatttgctcc tgtacagaac 126 0 

ctgaccggta gtgcagtcgg ccagaaagta acgctcaagt gggatgcacc taatggtacc 13 2 0 

45 ccgaatccga atccgggaac aacaacactt tccgaatcat tcgaaaatgg tattcctgco 13 8 0 

tcatggaaga cgatcgatgc agacggtgac ggcaacaatt ggacgacgac ccctcctccc 1440 

ggaggctcct cttttgcagg tcacaacagt gcaatctgtg tctcttcggc ttcttatatc 15 0 0 

aactttgaag gccctcagaa ccctgataac tatctggtta caccggagct ttctcttcct 15 60 

aacggaggaa cgcttacttt ctgggtatgt gcacaagatg ccaattatgc atcagagcac 1620 

50 tatgccgtgt atgcatcttc tacgggtaac gacgcttcca acttcgccaa cgctttgttgj 1680 

gaagaagtgc tgacggccaa gacagttgtt acggcaccgg aagccattcg tggcactcgt 1740 

gttcagggca cctggtatca aaagacggta cagttgcctg cgggtactaa gtatgttgcc 18 0 0 

ttccgtcact tcggctgtac ggacttcttc tggatcaacc tcgatgatgt tgagatcaag 18 60 

gccaacggca agcgcgcaga cttcacggaa acgttcgagt cttctactca tggagaggca. 192 0 

55 ccagcggaat ggactactat cgatgccgat ggcgatggtc agggttggct ctgtctgtcfc 19 80 

tccggacaat tgggatggct gacagctcat ggcggcacca acgtagtagc ctctttctca 2 04 0 

tggaatggaa tggctttgaa tcctgataac tatctcatct caaaggatgt tacaggcgca. 2100 

acgaaggtaa agtactacta tgcagtcaac gacggttttc ccggggatca ctatgcggtg 2160 

atgatctcca agacgggcac gaacgccgga gacttcacgg tcgttttcga agaaacgcct 222 0 

60 aacggaataa ataagggcgg agcaagattc ggtctttcca cggaagccaa tggcgccaaa. 2280 

cctcaaagtg tatggatcga gcgtacggta gatttgcctg cgggcacgaa gtatgttgct 2340 

ttccgtcact acaattgctc ggatttgaac tacattcttt tggatgatat tcagttcacc 24 00 

atgggtggca gccccacccc gaccgattat acctacacgg tgtatcgtga tggtacgaag 24 60 

atcaaggaag gtttgaccga aacgaccttc gaagaagacg gcgtagctac gggcaatcat 2520 

65 gagtattgcg tggaagtgaa gtacacagcc ggcgtatctc cgaaagagtg tgtaaacgta 2580 

actgttgatc ctgtgcagtt taatcctgta cagaacctga ccggtagtgc agtaggtcag 2640 

aaagtaacgc ttaagtggga tgcacctaat ggtaccccga atccgaatcc cggaacaact 270 0 



aaaactgata taaaacacca 
tcattctctc tatatggttc 
atctttacct taaacaaagt 
aagatgagca agcaaattgc 
atatttggtt atgattataa 
agctacttaa aatcttctaa 
cgcattaata gcaaatttag 
agtactgtga atataactga 
aaagtaaatg gtaaattatc 
atcagtcaac ctgatagaac 
aaatctgatg ctaagatgga 
actgcacatc aaggtacttt 



agattataag 42 60 

ccaacagctt 4320 

caaaagtaaa 43 8 0 

tSCtggtaat 4440 

tgcaatgcca 45 00 

tgaaaattat 45 60 

tgatagagtc 462 0 

tattgtgata. 4680 

taactctcag 4740 

tgctaaaacg 48 00 

gtatggtatc 48 60 

aaaagtacgt 4920 

4932 
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acactttccg aatcattcga aaatggtatt 
ggtgacggca acaattggac gacgacccct 
aacagtgcga tctgtgtctc ttcggcttct 
gataactatc tggttacacc ggagctatct 
5 gtatgtgcac aagatgccaa ttatgcatca 
ggtaacgacg cttccaactt cgccaacgct 
gttgttacgg cacctgaagc cattcgtggc 
acggtacagt tgcctgcggg tactaagtat 
ttcttctgga tcaacctcga tgatgttgag 
10 acggaaacgt tcgagtcttc tactcatgga 
gccgatggcg atggtcaggg ttggctctgt 
gctcatggcg gcaccaacgt agtagcctct 
gataactatc tcatctcaaa ggatgttaca 
.gteaacgacg gttttcccgg ggatcactat 
15 gccggagact tcacggttgt tttcgaagaa 
agattcggtc tttccacgga agccaatggc 
acggtagatt tgcctgcggg cacgaagtat 
ttgaactaca ttcttttgga tgatattcag 
gattatacct acacggtgta tcgtgatggt 

20 accttcgaag aagacggcgt agctacgggc 
acagccggcg tatctccgaa agagtgcgta 
cctgtacaga acctgaccgg tagtgcagtc 
cctaatggta ccccgaatcc gaatccggga 
ggtattcctg cctcatggaa gacgatcgat 

25 acccctcctc ccggaggcac ctcttttgca 
gcttcttata tcaactttga aggccctcag 
ctttctcttc ctaacggagg aacgcttact 
gcatcagagc actatgccgt gtatgcatct 
aacgctttgt tggaagaagt gctgacggcc 

30 cgtggtactc gtgttcaggg cacctggtat 
aagtatgttg ccttccgtca cttcggctgt 
gttgagatca aggccaacgg caagcgcgca 
catggagagg caccagcgga atggactact 
ctctgtctgt cttccggaca attgggatgg 

35 gcctctttct catggaatgg aatggctttg 
gttacaggcg caacgaaggt aaagtactac 
cactatgcgg tgatgatctc caagacgggc 
gaagaaacgc ctaacggaat aaataagggc 
aatggcgcca aacctcaaag tgtatggatc 

40 aagtatgttg ctttccgtca ctacaattgc 
attcagttca ccatgggtgg cagccccacc 
gatggtacga agatcaagga aggtttgacc 
acgggcaatc atgagtattg cgtggaagtg 
tgcgtaaacg taactattaa tccgacacag 

45 caagctccta acagcatgga tgcaatcctt 
gaagttctga acgaagactt cgaaaatggt 
gacggtgacg gcaacaattg gacgacgacc 
cacaacagtg cgatctgtgt ctcttcggct 
cctgataact atctggttac accggagctt 

50 tgggtatgtg cacaagatgc caattatgca 
acgggtaacg acgcttccaa cttcgccaac 
acagttgtta cggcaccgga agccattcgt 
aagacggtac agttgcctgc gggtactaag 
gacttcttct ggatcaacct tgatgatgtt 

55 acctatacga tctatcgtaa taatacacag 
cgagatccgg acttggctac cggtttttac 
ggagaatcag ctatcgaaac tgctacgttg 
cagaagcctt acacgctgac agttgtagga 
gctatgatct acgacatgaa cggtcgtcgt 

60 acggctcagg gcggccacta tgcagtcatg 
aaactcgctg taaagtaa 
<212> Type : DNA 
<211> Length : 6318 

SequenceName : SEQ ID 556 

65 SequenceDe script ion : 



cctgcctcat ggaagacgat cgafcgcagac 2760 

cctcccggag gcacctcttt tgcaggtcac 2820 

tatatcaact ttgaaggccc tcagaaccct 28 80 

cttcctaacg gaggaacgct tactttctgg 2940 

gagcactatg ccgtgtatgc atcttctacg 30 00 

ttgttggaag aagtgctgac ggccaagaca 3 0 60 

actcgtgttc agggcacctg gtatcaaaag 3120 

gttgccttcc gtcacttcgg ctgtacggac 3180 

atcaaggcca acggcaagcg cgcagacttc 3240 

gaggcaccag cggaatggac tactatcgat 33 00 

ctgtcttccg gacaattgga ctggctgaca 33 60 

ttctcatgga atggaatggc tttgaatcct 342 0 

ggcgcaacga aggtaaagta ctactatgca 3480 

gcggtgatga tctccaagac gggcacgaac 3540 

acgcctaacg gaataaataa gggcggagca 36 00 

gccaaacctc aaagtgtatg gatcgagcgt 3 660 

gttgctttcc gtcactacaa ttgctcggat 3720 

ttcaccatgg gtggcagccc caccccgacc 3780 

acgaagatca aggaaggttt gaccgaaacg 3840 

aatcatgagt attgcgtgga agtgaagtac 39 00 

aacgtaactg ttgatcctgt gcagttcaat 3960 

ggccagaaag taacgctcaa gtgcfgatgca 402 0 

acaacaacac tttccgaatc attcgaaaat 4080 

gcagacggtg acggcaacaa ttggacgacg 414 0 

ggtcacaaca gtgcgatctg tgtctcttcg 42 00 

aaccctgata actatctggt tacaccggag 4260 

ttctgggtat gtgcacaaga tgccaattat 43 20 

tctacgggta acgacgcttc caacttcgcc 43 80 

aagacagttg ttacggcacc ggaagccatt 4440 

caaaagacgg tacagttgcc tgcgggtact 45 00 

acggacttct tctggatcaa cctcgatgat 4560 

gacttcacgg aaacgttcga gtcttctact 4620 

atcgatgccg atggcgatgg tcagggttgg 4680 

ctgacagctc atggcggcac caacgtagta 4740 

aatcctgata actatctcat ctcaaaggat 48 00 

tatgcagtca acgacggttt tcccggggat 4860 

acgaacgccg gagacttcac ggtcgttttc 492 0 

ggagcaagat tcggtctttc cacggaagcc 4980 

gagcgtacgg tagatttgcc tgcgggcacg 5040 

tcggatttga actacattct tttggatgat 5100 

ccgaccgatt atacctacac ggtgtatcgt 5160 

gaaacgacct tcgaagaaga cggcgtagct 522 0 

aagtacacag ccggcgtatc tccgaaagag 52 80 

ttcaatcctg tacagaacct gacggcagaa 534 0 

aaatggaatg caccggcatc taagcgtgcg 540 0 

attcctgcct catggaagac gatcgatgca 5460 

cctcctcccg gaggctcctc ttttgcaggt 5520 

tcttatatca actttgaagg tcctcagaac 558 0 
tctcttcctg gcggaggaac gcttactttc . 5640 

tcagagcact atgccgtgta tgcatcttct 57 00 

gctttgttgg aagaagtgct gacggccaag 5760 

ggtactcgtg ttcagggcac cfcggtatcaa 582 0 

tatgttgcct tccgtcactt cggctgtacg 58 8 0 

gtaatcactt cagggaacgc tccgtcttac 5940 

atagcatcag gcgtaacgga gactacttac 60 00 

acgtacggtg ttaaggttgt ttacccgaac 6060 

aatatcactt cgttggcaga cgtaacggct 612 0 

aagacgatca cggtaacttg ccaaggcgaa 618 0 

ctggcagcgg gtcgcaacac ggtfcgtttac 6240 

gttgtcgttg acggcaagtc ctacgtagag 63 0 0 

6318 



Sequence 
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<213> OrganismName : Porphyromonas giiragivalis W83 
<400> PreSequenceString : 

atgaaaacat ctgaaagaat attaagttat ttctfccctct tatgtgctgt afctcagtctg 60 

5 ggctcatgcg aaggacttta tgcacaggta actttcccaa attattcgcc tacggcggct 12 0 

tcgtccattg ctgtatgttc tggagaagag acattgatca ttgactttac tgtagtccag 180 

gaggattcga atggtatcaa agttaatgtg aaacttgccg acggtgtcga gtatgtggtc 240 

ggaacggctg tcgtaagtgt tacacagggc aatgcagtga cggtggcgga aaccaatgtt 3 00 

tctaatccga acgaacctgt atttacggta aaatcrggcgg atggaaacaa fcgtggtagag 3 60 

10 cttggaacca tcgttaagct gacgattaag agga^agctg tctgtaccgc atggagcaat 420 

gccattaatg ctgccgaaac gggttttgtc ttcaaagaca aggtaacggt gactatcggc 48 0 

gatcatagtg atagcaagga atcaaactcc tattcggtaa actatccgaa cctgacgatc 540 

aaacagcctg cgccgcaagt gaacaagcag attggggaga ccatcgtacg agagttttct 60 0 

ataaccaatg gttctcagaa cccgacccag acagtttatc tttcgataga-gtatcccgat 660 

15 gaagcctacc tcacaggggt gggggcgatg acgcfctcagg ctaaactggg tgcttccggc 720 

acctatgccg atcttactcc taccgtcact aacggtaagg tccgcatcta tacactttcg 780 

ggttccagtt tggggcctga tcatctcttg accaacggcg agatcatcta tctgaaagaa 840 

acatttaagc tgaaaacttg tgcaccggtt acggtzzctata gggtaggttg gggttgtagc 900 

atagatagcc agtgtgagat aaaaactacc gctgcrtacaa ttactatggc agctggtgcg 960 

20 gctaatatca cgggatattc agttactggt ccfcgeattatc gttctccaac tttttctctt 1020 

tgccaaccgt ttgagttgac tattaagttc tcaaattccg gtgctggtgg ctctatgggg 10 80 

gcagcattca atatcaatac tattggtaga aacgactatt atagaccaag agggtttgtt 1140 

ttacacgaat ttattgatgt caaagtaaac ggfca^.gccgg taacgaattt caaaaccgat 12 0 0 

ggctcagagc tcgaccttcg ttttgatgga cagtfctacag aagatcctga cggaccgggg 1260 

25 gttggtttgg atgatgttga cggtgacgga ttttstgatg atcttcctgt cggagctact 13 2 0 

attacgatta ctgtaacggt gcggctaaag tgtgaitcagt ttacggcatg caacaacgct 13 80 

ccaaatgatt tgtccgatag gggcttgatt cttaaaacac tatatcagac atcttgtgat 1440 

agaacctcat ggatagatcc caacacgtgg ttcaatcttt ctagtactca tttgtatttg 1500 

tctcgtgagt cggtacaaga tgcctctcac atgcctactg taatagagaa ggatacgcct 1560 

30 ttcgacctga agataatgac ttcctactat tccatcctca gctcatataa taatatatgg 1620 

tacgccaatc ccaatacgcg gtatgtggta gaaatragtat tcccgcaagg tatgactatg 1680 

cctcccaaat cggatataga atggaccaat ataaaaaatc atccgataga cgggtcctta 1740 

gttttcactc caccgattaa cctccctgat gcaaatatca cgacatcagg aaacacaatg 1800 

actatcgttt ^cgcccagcca agaaagaggt tttgtaaccc tccatggtgt gaaatacgac 18 60 

35 tgtacaaata atcacgaaat ggttgtggag tataagatta gagaggtatt caactacctt 1920 

cacttccctg attgtctttg cccggtaggt cctatitatgt gtaacacggc aaagcgttat 1980 

gttttgggct gcgatcctcc ctgcggtaga ggtgcggaaa ctfccggtgcc taagatagaa 2040 

cgtgccgaca attcgttggg ctggacggat tatacrgatgc ggacccgtca atctcgtagc 2100 

aatatatcgg cttacgactt ggctaaagcc ctatatatgg acgaagtcaa cattacagcg 2160 

40 acttctatcc agcatggtac tgcttcgtct ttgg^cgccc gttttgtttt ggctacgggt 222 0 

gtcgatcgag tagaaacgct tactcctctt tcggc:cgata ttaagatctt ccgtgatggg 2280 

gttcagattg tatcggtaga tggatatacg acatfcccgtt caatacgccg aaataataat 23 40 

gcagagcagg tgatcgactg ggattttact tcaafcccttc ctgctggtgg attgcttgat 2400 

agagacaaag tggatgttgt tacccgttat cgggtiaacat cccagaatgc tcatagagta 2460 

45 gatacgcaag tcggtaggga gtggttcttc tacaactcta ccgccaatgt gtcaccgata 2520 

tgggatgaag ccaatccgtt aacttgtctc atacfctgtgc ccgagatata catcatgggt 258 0 

acttttgttg taaacggtac cgatccacat gtcatittcac aatgtactcc aacagatctg 2640 

ggacgcgttg ctaaccacta cgcccgtcgt ttcg^ctctg gtgcatttga atatgccaat 2700 

gaatatcgtc ctggtgtaaa gattagaaat atctatctga aagtaccgaa gtcctacacg 2760 

50 ctgaataggg tggagtatag caatcaccgt aaccatagtt cgttaggtac aaccatgcct 2 820 

ttcgaggaaa taaatcatac agatgtgact tcacsigggtg aatataacat ctataagtat 2880 

caacttgcag acaacgaaaa ggcgcacttc aatafctacag taaaaaatgc ctatggagca 2940 

gctcttaaag taaatgtatc tcccacttgt gcgtczrgtctg ctgtagcaac taattatgat 3 0 00 

aaaatttcat actatgtcga ttacattgac tattactatt atgcagcaac gcagccaaca 3 060 

55 gtacctaata gccttgacat agtagccgat caatcrggctg gcagcaacgg aatctacagt 3120 

gtttccgccc tcaatgttta caacaggcct atcctttata ctaacaaacc ttctattgcg 318 0 

ctcgtcaatc agtcaggtga ggtagagctt gtgggtaaaa cgggagagtg gaagctgcgt 3240 

atcagcaacc catcgagtgc aacggctccc tatgtttggt tggcattgcc tacaacatcg 3 3 00 

gggctgacca tcgaaaaagt aactgatgcg gcagcjtactg aaatggcgtt tacaacttat 33 60 

60 tctggtggca agatgtatcg tttgtcggaa gctggtgttc cagtaggttc tgcgcttgac 3420 

tataccattc actttaccta ttctggttgc tctccntatcg ctttgaaggc gatggggggc 3480 

tggaactgta gtgcatatcc ccttagtttg gatgeigtatg tttgcagttc gcaggtgatc 3540 

gatctcaagc tcaagccact gccagctgcc atggsgctta ctgagatagc tgttccagat 3600 

cctacagctg ctgctacatt gtgtagtaca ttggsiatata tttacagcat tcaatcgaca 3660 

65 gataatgcga acgtttatag tcctactttc agcafccttcc ccgaagaggg attggtggtg 3720 

acaccgaatc aggtacaggt ggaatatcct gccggttccg gtaattgggc tgcactcaat 3780 

gtggtcaata attccgtcaa tctgttgcag catccz:tgcat tgactaccat aggctatctc 3 840 
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aagggcctga aagaggggga atccaacgac aatcaacgta aaattttggt gaagttctac 39 00 

ataaagaccg agtgttcgtt cgtatctggc aagaacttcc gtgtaagagc tgacggccgt 3960 

aatgcttgta atcagaatgc caagggatcg ggtcttgcca taagtacgcc tccaattaga 402 0 

ataaatggag ctatagagcc ctacacgact tctgcttcta cgcagcttgt tacgaccaca 4080 

5 acatcacaat cggactgtaa agctcccaaa agagtaaaag tggfcgcaaac ggtcgtaggt 4140 

ggagaaacta cccccaaggc atatttggaa atcacgctgc cgttgggctt taagtatgtg 4200 

acgggttctt atgctccgga caatacgcat ccgggaggag tcaatgcctc acctgccgga 4260 

acggaagaag tcactttgac cgcgaacggt gaagacaaga ttaagataaa tgtcaaggcc 4320 

ggtctgacgt caggtcaatc gtttgcttat acactcgaaa tgaaggaaga cgatgataat 43 80 

10 gtgccggctt gtgtjcaatca taccatcgaa attgtcaatg ttgaggagat tgaaggtttg 4440 

tggtgtgaag gcgttcagtg tgcagaaact ttggtcgtca cgggtgccaa caagtttgaa 450 0 

tttgagcttg ataagcctta cttggatatt acggttattt cagcagtatc gactttcagt 4560 

ggtggtaagg aaaatcttac aattgagtat aaggtaagca atacatcgac cacccagcct 4620 

ctgaaaccgg gagcggttgt aacgctgttc agcgataagg ataacaatca agtcttctcc 4680 

15 ggcggagatg ttgctgttgc aacacaggag ttggtcgcag aaataactaa taccacacct 4740 

cttacgcaga taatgaaggt aaaaggagtg agctcttccc atacgggcaa tttggttctt 4800 

acgatactgc ccaaagacgg ttgctactgt gagatcaaat cccctatggt cacgttaaac 48 60 

catcttcctt cgaattactg gatcggagga actgtaggta agcctaacga atggaaagag 4920 

ccgaacaact ggaccaatga ccaagttccc gatgcggcag aggatgttga attcgccact 4980 

20 gaggtgaata acccgactga tccgaataat ccgaagtcgg gtcctgcgaa ggagaacctg 5040 

catctggacg atatacacca gaatggcaca gccggtcgcg ttatcggcaa cctgatcaat 510 0 

gactctgaca aagatctggt aatcacaacg ggcaatcaat tgacgatcaa cggcgtggtt 5160 

gaggataaca atccgaatgt cggtacgatc gtcgtgaagt cgtcgaaaga caatcctacg 5220 

gggacgttgc ttttcgccaa tccgggcaat aatcaaaatg taggggggac tgtcgagttt 5280 

25 tacaatcagg gatatgattg tgccgattgt ggtatgtatc gcaggagctg gcagtatttc 5340 

ggtatccctg tcaatgaatc agattttcca tatgatcatg ttgatggaaa cgcgaccgtc 5400 

aaccaatggg ttgagccttt caatggcgat aagtggcggc ctgcacctta tgcacctgat 5460 

acaaagcttc agaaattcaa gggctatcag atcacgaatg acgtgcaggc acagcctacg 5520 

ggagtttaca gcttcaaggg tacgctttgt gtgtgcgatg ccttcctgaa cctgacacgc 558 0 

30 acgtccggtg tcaactactc gggcgccaac ttgatcggca actcatacac tggagctatc 5640 

gacatcaagc aaggtattgt cttcccgccg gaagtcgagc agacggtgta tctgttcaac 57 0 0 

acgggaacac gcgaccagtg gcgtaagctt aatggaagca cggtttcagg ctatcgagcc 5760 

ggtcagtacc tctctgtacc taagaataca gcgggtcagg acaatcttcc ggatcgtatt. 582 0 

ccatcgatgc attccttctt ggtgaagatg cagaacggag cctcttgtac gttgcagatc 5880 

35 ttgtacgafca agctgctcaa gaacacgact gtaaacaacg gtaatggtac gcagatcaca 5940 

tggcgatccg gcaactccgg atcggcgaat atgccgtcac ttgtgatgga tgttcttggt 60 00 

aatgagtcgg ccgaccgttt gtggatcttt accgatgggg gtctttcgtt cggattcgac 6060 

aacggctggg atggtcgcaa gctgactgaa aaaggtttgt cacaacttta tgcgatgtct 612 0 

gacatcggta atgafcaaatt ccaggttgca ggggttccgg agttgaataa cctgctgatc 6180 

40 ggcttcgatg cggataagga tggtcaatac acgttggagt ttgctctttc ggatcatttt 6240 

gcgaaaggcg gagttttcct tgaggatctt agccgtggag ttacacggcg agtagtcgac 63 00 

ggcggttcgt attcattcga tgccaagcga ggagactccg gggctcgttt ccgtctctct 6360 

tatgacgaag agtgggttga atcggcagag gtttctgttt tggttggtac ggccgggaag 6420 

cgaatcgtta tcacgaataa cagtgagcat gcctgtcagg ccaatgttta tacaactgac 64 80 

45 ggaaaacttc tgattcgatt ggacgtaaaa cccggaagta agtctatgac ggaaccattg 6540 

gtcgatggag tctacgttgt cagtctgcaa agtcctgcca cgagtagcaa tgtaaggaaa 6600 

gttgtagtca actaa 6615 
<212> Type : DNA 
<211> Length : 6615 

50 SeguenceName : SEQ ID 557 

SequenceDescription : 

Sequence 



55 <213> OrganisniName : Porphyromonas gingivalis W83 

<400> PreSequenceString : 

atgaacaaat tttacaaatc actttfcgcag 

actgcactga ccgcttctgc acagatttcg 

agatccgccg gaacgcattc attcgacgat 
60 aatccggaag acctgatcgc acagagccgt 

cggataggac aagtaatacc ggtggatgtg 

tccatcggag acgtagatgt atatcgcctg 

acgctttatt acgatgcatt caatattccg 

gaccatgaaa ttgtgttggg agcatatacg 
65 gccacagagc cggtaccggg gagtgagctt 

actttgcctg acatcaagat ctccggtgcg 

cccgtaacgg ataaccatta cgggatcggt 



tcaggactgg ctgccttcgt gtcgatggca 60 

ttcggagggg aacccttgag tttctcttca 120 

gcaatgacta tccgccttac tccggatttc 180 

tggcaatcgc aaagagatgg ccggcccgtc 240 

gactttgcat ccaaggcttc gcacatctct 3 00 

caattcaagt tggaaggagc caaagccatt 3 60 

gagggcggac gcctctatat ctataccccc 420 

aacgccactc afccgccgcaa cggagctttt 480 

attatggatt atgaagtgtc tcgcggaggg 540 

ggttatatat tcgacaaagt cggcggacgc 600 

gaggacgatt ccgattcgga ttgcgagatc 660 
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aacatcaatt gtcctgaagg tgcagactgg caggcagaga agaacggtgt ggtgcaaatg 72 0 

atcatggtaa aaggacagta tatctcaatg tgctcaggca acctgctcaa taatacgaaa 780 

ggagacttta ctccgctgat catttctgcc ggacactgtg cttccataac aaccaatttc 840 

ggtgtaacgc aatccgagtt ggataagtgg atcttcactt tccactatga aaaaagagga 900 

5 tgcagcaatg gtacattggc catcttccgt ggcaacagta tcatcggagc ttccatgaag 960 

gctttcctcc cgatcaaagg taaatccgat ggtctcttgc tgcaactcaa cgatgaagtc 1020 

cctctgcgct atcgtgtcta ttacaatgga tgggacagta cgcccgatat tccctcgagc 10 8 0 

ggtgccggta ttcatcatcc ggccggagat gccatgaaga tttccatcct aaagaagact 1140 

ccggctctga atacatggat ctcctccagt ggttccggag ggactgacga tcacttctat 120 0 

10 ttcaaatacg atcaaggtgg tacggaagga ggatcgtccg gttcttctct cttcaatcag 1260 

aataagcacg tggtcggcac actgaccgga ggtgccggca attgtggcgg gacggagttc 1320 

tacggcagac tgaacagtca ttggaacgag tatgcatccg atggcaatac gagccgcatg 13 80 

gacatctatc tggatcccca aaacaatggc cagacgacca tcctcaacgg aacgtatcgt 144 0 

gacggttata agcctttgcc ctctgtgccc cggctattgt tgcagtctac aggcgatcag 1500 

\5 gtcgaattga attggacggc tgttcctgcc gatcaatatc catcatctta tcaggtcgaa 1560 

taccacatat tccgaaatgg aaaggaaata gctacgacaa aggagttgtc ctattcggat 1620 

gccatcgacg aaagtattat cggtagcggt atcattcgat acgaagtaag cgcacgcttc 168 0 

atttatccct cgccgttgga tggagtggaa tcttataagg atacggacaa gacttctgcc 1740 

gaccttgcca taggagacat tcagaccaag ctgaagccgg acgtaacacc tctccccgga 180 0 

20 ggaggagtat cattaagctg gaaagttcct ttcttaagcc agttggtttc ccgattcgga 1860 

gaaagcccca atcctgtgtt caaaaccttt gaagtgccct atgtttctgc cgcagccgca 1920 

caaaccccca atcctcccgt tggcgtagtc attgcagaca agtttatggc cggtacatat 198 0 

cccgaaaagg ctgctatcgc tgccgtttat gtaatgccat ccgctccgga ctctactttc 204 0 

cacctcttcc tcaagagcaa cacaaacaga agattgcaga aggtgacaac tccctccgat 210 0 

25 tggcaggccg gaacatggtt gaggatcaat ttggataagc cgttcccggt gaataatgac 2160 

catatgcttt ttgccggtat cagaatgcct aataagtaca agctcaatcg tgctatccgt 2220 

tatgtaagaa atccggataa ccttttctcc attaccggta agaagatttc atataacaac 22 80 

ggagtctctt tcgaaggcta cggaataccc tcgctcttgg gctatatggc tatcaaatat 234 0 

ctggtggtaa ataccgatgc tccgaagatc gatatgtcgc ttgtacagga gccttatgct 240 0 

30 aagggaacga atgtggctcc attccccgaa ttggtcggca tatatgtcta taagaacgga 2460 

acatttatcg gcacacagga tccatccgtc acaacttatt cggtttcaga cggaacagag 2520 

agcgatgaat acgaaataaa actggtatat aagggatcgg gcatttcgaa tggcgttgct 2580 

cagattgaga ataacaatgc tgtcgttgca tatccgtctg ttgtaacaga tcgtttcagc 2640 
attaagaacg ctcatatggt tcacgctgcc gccctctact cattggatgg caagcaggtt .. 27 00 

35 cgttcttgga acaacctccg caatggcgtg acattcagtg ttcaaggact tacggccggt 2760 

acttatatgc tcgttatgca gacggcaaac ggccctgtga gccaaaagat cgtgaagcag 2820 

tag 2823 
<212> Type : DNA 
<211> Length. : 2823 

40 SeguenceName : SEQ ID 558 

SequenceDescription : 

Sequence 



45 <213 > Organi sraName : Porphyromonas gingivalis W83 
<4 00> Pre SequenceSt ring : 

atgaaaaact tgaacaagtt tgtttcgatt gctctttgct cttccttatt aggaggaatg 60 

gcatttgcgc agcagacaga gttgggacgc aatccgaatg tgagattgct cgaatccact 120 

cagcaatcgg tgacaaaggt tcagttccgt atggacaacc tcaagttcac cgaagttcaa 180 

50 acccctaagg gaatggcaca agtgccgacc tatacagaag gggttaatct ttctgaaaaa 240 

gggatgccta cgcttcccat tctatcacgc tctttggcgg tttcagacac tcgtgagatg .3 00 

aaggtagagg ttgtttcctc aaagttcatc gaaaagaaaa atgtcctgat tgcaccctcc 3 60 

aagggcatga ttatgcgtaa cgaagatccg aaaaagatcc cttacgttta tggaaagagc 420 

tactcgcaaa acaaattctt cccgggagag atcgccacgc ttgatgatcc ttttatcctt 4 80 

55 cgtgatgtgc gtggacaggt tgtaaacttt gcgcctttgc agtataaccc tgtgacaaag 540 

acgttgcgca tctatacgga aatcactgtg gcagtgagcg aaacttcgga gcaaggcaaa 600 

aatattctga acaagaaagg tacatttgcc ggctttgaag acacatacaa gcgcatgttc 660 

atgaactacg agccagggcg ttacacaccg gtagaggaaa aacaaaatgg tcgtatgatc 720 

gtcatcgtag ccaaaaagta tgagggagat attaaagatt tcgttgattg gaaaaaccaa 7 80 

60 cgcggtctcc gtaccgaggt gaaagtggca gaagatattg cttctcccgt tacagctaat 840 

gctattcagc aattcgttaa gcaagaatac gagaaagaag gtaatgattt gacctatgtt 900 
cttttgattg gcgatcacaa agatattcct gccaaaatta ctccggggat caaatccgac 960 

caggtatatg gacaaatagt aggtaatgac cactacaacg aagtcttcat cggtcgtttc 1020 

tcatgtgaga gcaaagagga tctgaagaca caaatcgatc ggactattca ctatgagcgc 1080 

65 aatataacca cggaagacaa atggctcggt caggctcttt gtattgcttc ggctgaagga 1140 

ggcccatccg cagacaatgg tgaaagtgat atccagcatg agaatgtaat cgccaatctg 12 00 

cttacccagt atggttatac caagattatc aaatgttatg atccgggagt aactcctaaa 1260 



WO 2005/076010 



237/341 



PCT/IN2005/000037 



aacattattg atgctttcaa cggaggaatc tcgttggcca actatacggg ccacggtagc 1320 

gaaacagctt ggggtacgtc tcacttcggc accactcatg tgaagcagct taccaacagc 13 80 

aaccagctac cgtttatttt cgacgtagct tgtgtgaatg gcgatttcct attcagcatg 1440 

ccttgtttcg cagaagcatt gatgcgtgca caaaaagatg gtaagccgac aggtactgtt 150 0 

5 gctatcatag cgtctacgat caaccagtct tgggcttctc ctatgcgcgg gcaggatgag 1560 

atgaacgaaa ttctgtgcga aaaacacccg aacaacatca agcgtacttt cggtggtgtc 1620 

accatgaacg gtatgtttgc tatggtggaa aagtataaaa aggatggtga gaagatgctc 1680 

gacacatgga ctgtattcgg cgacccctcg ctgctcgttc gtacacttgt cccgaccaaa 1740 

atgcaggtta cggctccggc tcagattaat ttgacggatg cttcagtcaa cgtatcttgc 18 00 

10 gattataatg gtgctattgc taccatttca gccaatggaa agatgttcgg ttctgcagtt 18 60 

gtcgaaaatg gaacagctac aatcaatctg acaggtctga caaatgaaag cacgcttacc 1920 

cttacagtag ttggttacaa caaagagacg gttattaaga ccatcaacac taatggtgag 1980 

cctaacccct accagcctgt ttccaacttg actgctacaa cgcagggtca gaaagtaacg 2040 

• Gtcaagtggg atgcaccgag cacgaaaacc aatgcaacca ctaataccgc .tcgcagcgtg . t 2100 

15 gatggcatac 'gagaactggt tcttctgtca gtcagcgatg cccccgaact tcttcgcagc 2160 

ggtcaggccg agattgttct tgaagctcac gatgtttgga atgatggatc cggttatcag 2220 

attcttttgg atgcagacca tgatcaatat ggacaggtta tacccagtga tacccatact 22 80 

ctttggccga actgtagtgt cccggccaat ctgttcgctc cgttcgaata tacggttccg 2340 

gaaaatgcag atccttcttg ttcccctacc aatatgataa tggatggtac tgcatccgtt 24 00 

20 aatataccgg ccggaactta tgactttgca attgctgctc ctcaagcaaa tgcaaagafct 2460 

tggattgccg gacaaggacc gacgaaagaa gatgattatg tatttgaagc cggtaaaaaa 252 0 

taccatttcc ttatgaagaa gatgggtagc ggtgatggaa ctgaattgac tataagcgaa 2580 

ggtggtggaa gcgattacac ctatactgtc tatcgtgacg gcacgaagat caaggaaggt 2640 

ctgacggcta cgacattcga agaagacggt gtagctgcag gcaatcatga gtattgcgtg 2700 

25 gaagttaagt acacagccgg cgtatctccg aaggtatgta aagacgttac ggtagaagga 2760 

tccaatgaat ttgctcctgt acagaacctg accggtagtg cagtcggcca gaaagtaacg 2820 

cttaagtggg atgcacctaa tggtaccccg aatccaaatc caaatccgaa tccaaatccg 28 80 

aatcccggaa caactacact ttccgaatca ttcgaaaatg gtattcctgc ctcatggaag 2940 

acgatcgatg cagacggtga cgggcatggc tggaagcctg gaaatgctcc cggaatcgct 3000 

30 ggctacaata gcaatggttg tgtatattca gagtcattcg gtcttggtgg tataggagtt 3 060 

cttacccctg acaactatct gataacaccg gcattggatt tgcctaacgg aggtaagttg 3120 

actttctggg tatgcgcaca ggatgctaat tatgcatccg agcactatgc ggtgtatgca 3180 

tcttcgaccg gtaacgatgc atccaacttc acgaatgctt tgttggaaga gacgattacg 3240 

gcaaaaggtg fctcgctcgcc ggaagctatt cgtggtcgta tacagggtac ttggcgccag 3 3 00 

35 aagacggtag accttcccgc aggtacgaaa tatgttgctt tccgtcactt ccaaagcacg 3360 

gatatgttct acatcgacct tgatgaggtt gagatcaagg ccaatggcaa gcgcgcagac 342 0 

ttcacggaaa cgttcgagtc ttctactcat ggagaggcac cagcggaatg gactactatc 3480 

gatgccgatg gcgatggtca gggttggctc tgtctgtctt ccggacaatt ggactggctg 354 0 

acagctcatg gcggcaccaa cgtagtaagc tctttctcat ggaatggaat ggctttgaat 360 0 

40 cctgataact atctcatctc aaaggatgtt acaggcgcaa cgaaggtaaa gtactactat 3660 

gcagtcaacg acggttttcc cggggatcac tatgcggtga tgatctccaa gacgggcacg 3 720 

aacgccggag acttcacggt tgttttcgaa gaaacgccta acggaataaa taagggcgga 3780 

gcaagattcg gtctttccac ggaagccgat ggcgccaaac ctcaaagtgt atggatcgag 3840 

cgtacggtag atttgcctgc gggcacgaag tatgttgctt tccgtcacta caattgctcg 3900 

45 gatttgaact acattctttt ggatgatatt cagttcacca tgggtggcag ccccaccccg 3960 

accgattata cctacacggt gtatcgtgat ggtacgaaga tcaaggaagg tttgaccgaa 4020 

acgaccttcg aagaagacgg cgtagctacg ggcaatcatg agtattgcgt ggaagtgaag 408 0 

tacacagccg gcgtatctcc gaagaaatgt gtaaacgtaa ctgttaattc gacacagttc 4140 

aatcctgtaa agaacctgaa ggcacaaccg gatggcggcg acgtggttct caagtgggaa 42 00 

50 gccccgagcg caaagaagac agaaggttct cgtgaagtaa aacggatcgg agacggtctt 4260 

ttcgttacga tcgaacctgc aaacgatgta cgtgccaacg aagccaaggt tgtgctcgca 4320 

gcagacaacg tatggggaga caatacgggt taccagttct tgttggatgc cgatcacaat 438 0 

acattcggaa gtgtcattcc ggcaaccggt cctctcttta ccggaacagc ttcttccgat 4440 

ctttacagtg cgaacttcga gtatttgatc ccggccaatg ccgatcctgt tgttactaca 4500 

55 cagaatatta tcgttacagg acagggtgaa gttgtaatcc ccggtggtgt ttacgactat 4560 

tgcattacga acccggaacc tgcatccgga aagatgtgga tcgcaggaga tggaggcaac 462 0 

cagcctgcac gttatgacga tttcacattc gaagcaggca agaagtacac cttcacgatg 4680 

cgtcgcgccg gaatgggaga tggaactgat atggaagtcg aagacgattc acctgcaagc 4740 

tatacctata cagtctatcg tgacggcacg aagatcaagg aaggtctgac cgaaacgacc 4800 

60 taccgcgatg caggaatgag tgcacaatct catgagtatt gcgtggaagt taagtacaca 4860 

gccggcgtat ctccgaaggt ttgtgtggat tatattcctg acggagtggc agacgtaacg 4920 

gctcagaagc cttacacgct gacagttgta ggaaagacga tcacggtaac ttgccaaggc 4980 

gaagctatga tctacgacat gaacggtcgt cgtctggcag ccggtcgcaa cacggttgtt 5040 

tacacggctc agggcggcta ctatgcagtt atggttgtcg ttgacggcaa gtcttacgta 5100 . 

65 gagaaactcg ctatcaagta a 5121 
<212> Type : DNA 
<211> Length : 5121 
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SequenceName : SEQ ID 559 
SequenceDescription : 

Sequence 
5 

<213> OrganisraName : Porphyromonas gingivalis W83 

<4 00> PreSequenceString : 

atgaaaagaa aaccgctatt ctcagccctt 

cacccggcct cagcacagaa agttcctgca 
10 ctaagtgaag ccgatgtgga gtgtacaatc 

gacatttggg cagacctcaa cggaaacggc 

ggtgagtttc gtgatgttga gttcagacaa 

aaattcttgt ttagaggttc ttctgcaggg 

aattgtaccg gcctgactgc tttcgactgc 
15 tccaaagcaa acggtctgac ttttgtaaac 

ctgcccgcaa atgcggacat tgagacgctg 

aacctatcga cctataccaa gctgaaagag 

ttggatctct ccgccaatac gctcctcgaa 

acgataaacc tgtctgccaa tacgaacttg 
20 accggactcg atgtcgcagc caacaaagag 

ctgaccgccc tcaatctctc ggccaatacc 

gagctgacaa atatcgatct ctccgacaac 

aataagctga cgaagttaga tgtatctgcc 

aacaaccagc tgactgctct ggatctgtca 
25 tactccaacc ggatcaaaga taccgccatg 

actgaaggag aaggcaggtt cgttccttac 

gtgtgtacaa ccgaacacgt ggaaatggcc 

tggggagagc ctttccccgg aataacggct 

tccgtatatg ctcaagatgg catcctctac 
30 caggtatata ccgtgggagg aagcatgatg 

gaaatacagc tcccgagagg tgcagcctat 

accgcgatgc cgtaa 

<212> Type : DNA 

<21I> Length : 14 5 S 
35 SequenceName : SEQ XD 560 

SequenceDescription : 

Sequence 



40 <213> OrganismName r Shigella flexneri 2a str. 2457T 

<40 0> PreSequenceString r 

atgaaacgag ctattaccct gtttgctgta 

tcattcgcct gtaaaaccgc caatggtacc 

gtttatgtaa accttgcgcc tgtcgtgaat 
45 acgcaaatct tttgccataa cgattatccg 

cgaggctcgg cttacggcgg cgtgttatct 

agtagctatc catttccgac caccagcgaa 

gataagccgt ggccggtggc gctttatttg 

attaaagctg gttcattaat tgccgtgctt 
50 gatgattttc agtttgtgtg gaatatttac 

ggttgtgatg tttctgctcg tgatgtcacc 

ccaattcctc ttaccgttta ttgtgcgaaa 

acaaccgcag atgcgggcaa ctcgattttc 

ggcgtcggcg tacagttgac gcgcaacggt 
55 ttaggagcag taggaacttc ggcggtaagt 

ggagggcagg tgactgcagg gaatgtgcaa 

taa 

<212> Type : DNA 
<211> Length : 903 
60 SequenceName : SEQ ID 561 

SequenceDescription : 

Sequence 



65 <213> OrganismName : Shigella flexneri 2a str. 2457T 
<400> PreSequenceString : 

atgggcatca aacaacacaa tgggaatacc aaagccgatc gtctcgctga attaaaaatc 60 



gtaatccttt ccggcttctt 
cccgtcgatg gcgagcgcat 
aaaatagaag ccgaggatgg 
aagtacgatt cgggggagag 
acaaaggcca tcgtctatgg 
gactatggtg ctacctttat 
tttgccaatc tgctgacaga 
tgcggcaaaa accagctgac 
aactgctcca aaaacaagat 
ctttatgtgg gcgacaacgg 
gagctggtgt attctaacaa 
aaaagcctgt attgcataaa 
ctgaaaatac tccactgcaa 
aagctgacga ctctaagctt 
acggctttgg agtggctttt 
aacgccaatc tgatagcact 
aaaacgccga aactgacaac 
cgtgcattga tcgaaagcct 
aacgacgatg aaggaggaga 
aaggccaaga attggaaggt 
ttgatttcca tcgaaggtga 
ctctccggta tggagcaggg 
tactcatctg tcgcttccgg 
gtagtacgta tcggcagcca 



cggatcggtt 60 

tatcatggag 12 0 

ctatgccaac 180 

gctcgattca 240 

caaaatggcc 3 00 

agatattagc 3 60 

actcgatctg 420 

caagcttgac 480 

aacgagtctc 54 0 

gctgacagcc 60 0 

cgaggtgact 660 

caataagatg 72 0 

caacaatcag 780 

cttcaacaac 840 

ctgcaacggc 90 0 

gcaatgcagc 960 

gttgaattgc 1020 

gcctacgatc 10 8 0 

agaggagaac 1140 

acttacctcg 1200 

gagcgaatat 12 60 

cttgcccgtt 1320 

atcagccatg 13 80 

tgcgatcaaa 1440 

1455 



ctgctgatgg gctggtcggt aaatgcctgg 60 

gctatcccta ttggcggtgg cagcgctaat 120 

gtggggcaaa acctggtcgt agatctttcg 180 

gaaaccatta cagactatgt cacactgcaa 240 

aatttttccg ggaccgtaaa atatagtggc 3 00 

acgccgcggg ttgtttataa ttcgagaacg 3 60 

acgcctgtga gcagtgcggg cggggtggct 420 

attttgcgac agaccaacaa ctataacagc 480 

gccaataatg atgtggtggt gcccactggc 540 

gttactctgc cggactaccc tggttcagtg 600 

agccaaaacc tggggtatta cctctccggc 660 

accaataccg cgtcgttttc accagcgcag 720 

acgattattc cagcgaataa cacggtatcg 780 

ctgggattaa cggcaaatta cgcacgtacc 840 

tcgattattg gcgtgacttt tgtttatcaa 900 

903 
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cgttcgccct caattcaact gataaaattt ggcgctattg gtttg^tatgc aattatcttt 120 

tcccccctgc tgatagctgc tgatacagga agtcaatatg gcaccaatat tactattaat 180 

gatggtgaca gaattacagg agataccgcc gatccatcag gaaacctcta tggtgtaatg 240 

accccagcag gaaacacgcc tggcaatatc aacctgggta atgatgtcac cgtcaatgtc 3 00 

5 aacgacgcct ctggatatgc aaaaggaatc attattcagg gcaaaaacag ctccctgaca 3 60 

gctaaccgac tcacagtaga tgttgttggt caaacctctg ccatcggcat taacttaatt 42 0 

ggtgactata cccatgctga cttaggcaca ggcagcacca ttaagagtaa cgatgacggc 480 

atcattattg ggcatagctc aacactaaca gccactcaat tcaccattga aaactcgaac 540 

ggtataggcc taaccatcaa tgactatggc accagtgtcg atctteggaag cggaagtaaa 600 

10 atcaagaccg atggaagtac aggtgtttat atcggtggtc tcaacggcaa taacgccaat 660 

ggtgctgcgc gttttacggc gacagacctg acaatcgatg ttcag^gcta cagcgccatg 720 

gggataaacg tacagaaaaa ctctgttgtc gatctcggaa caaacagtac cattaaaacc 780 

aatggcgata atgctcacgg cctctggagc tttggccagg ttagcgcgaa tgcactcact 840 
- gtfcgatgtaa „ctggagccgc ggccaatggc gtcgaagttc gtggt^gtac .aaccactatc f * .900 

15 ggtgcagata gccatatttc ttccgcgcag ggcggtggcc tcgtcaccag tagttcagac 960 

gcgacaatca atttttctgg cacggcagcg caacgaaaca gcatcttttc cggcggttct 1020 

tatggtgcct cggcccagac ggcaacggct gttatcaaca tgcaaaatac cgatattacg 108 0 

gttgatcgta atggcagtct ggcgctgggt ttgtgggcgc tcagc^gcgg tagaataacc 1140 

ggagacagtt tggctatcac cggcgcggca ggagccagag ggatt-fcatgc catgaccaac 12 00 

20 agccagatcg acctcacgag cgatctggtc attgatatga gtacacccga ccagatggcc 1260 

atcgcaacgc aacatgacga tggttatgcc gccagccgca tcaacgcctc gggtcgtatg 1320 

cttatcaacg gtagcgttct ttccaaaggt gggctaatca atctggatat gcaccctggg 1380 

tcggtttgga caggttcctc cctcagcgat aatgtcaatg gcggga.aact ggacgttgca 1440 

atgaataaca gcgtctggaa cgtaacaagt aattctaatc tcgacetcgct ggcgctgagc 1500 

25 cattcaactg tcgattttgc cagccacggg tcaactgccg gcacatttac cacattaaac 1560 

gtagagaacc tgagcggtaa cagtaccttt attatgcgtg ctgatcgttgt tggcgagggt 1620 

aatggcgtta ataatagagg ggatttattg aatatcagcg ggagtsgtgc tggtaatcac 168 0 

gtattggcta tccgcaacca gggcagcgag gccacaacgg gaaat^aagt tctgacagtg 1740 

gtaaaaacca ctgacggcgc ggcctcgttc agcgcgtctt ctcagcgttga gttgggggga 180 0 

30 tatctgtacg atgfcgcgtaa aaatggcact aactgggagc tttacgcttc cgggacagtt 1860 

ccggaaccga ctcctaatcc tgaacccaca ccagctcccg ctcagcctcc catagtcaac 1920 

cccgatccta cgcctgaacc cgctcccacg cctaaaccca ccacgaccgc agatgctggc 1980 

ggcaattatc tcaatgtcgg ttacttattg aactatgttg aaaaccgtac gctgatgcaa 2 040 

cggatgggtg acttgcgaaa tcagagtaaa gacggtaata tctgg^-fctgcg cagttatggg 210 0- 

35 ggaagcctgg actcctttgc cagtggcaaa ctgagcggct ttgac -a.fcggg ttacagcggt 2160 

atccagtttg gtggggataa acgtctctct gatgtaatgc cgttg-tatgt cggtctgtat 2220 

attgactcaa cacatgcatc gccggactat agcggaggcg acggt accgc acgttcagac 2280 

tacatgggaa tgtacgccag ttacatggca caaaacggtt tttacagcga tctcgttata 2340 

aaagcatcgc gccagaaaaa tagtttccac gtactggaca gtcagaacaa cggcgttaac 2400 

40 gccaacggca ctgcgaatgg aatgagcatc tccctggaag ccgggcagag gttcaacctg 2460 

tcccctactg gttatgggtt ctatatagag ccgcaaaccc agcttacata cagccaccag 252 0 

aatgagatgg ctatgaaggc gagtaatggc ctcaatatac atctgaatca ctacgaatcg 258 0 

ctgctggggc gtgccagcat gatactgggg tatgacatca ccgca.ggcaa cagccagctg 2640 

aatgtctatg tgaagactgg cgctatccgc gagttttcag gggat accga atatctgttg 2700 

45 aacgactccc gggagaagta cagtttcaaa ggtaatggct ggaat<a.acgg cgfcgggagtc 2760 

agtgcacagt ataacaaaca gcacacattc tatctcgaag cggattacac gcagggtaac 2820 

ctctttgatc agaagcaagt caacggagga tatcgcttca gcttt-taa 2868 
<212> Type : DNA 
<211> Length : 2868 

50 SequenceName : SEQ ID 562 

SequenceDescription : 

Sequence 



55 <213> OrganismName : Shigella flexneri 2a str. 24 57T 
<400> PreSequenceString : 

atgagtaagt ttgttaaaac agctattgct gcggcaatgg tgatg-<ggcgt gttcacctct 60 

acggcaacaa tcgctgcggg taacaacggt actgcacgtt tttacrcjgcac cattgaagac 12 0 

tcagtgtgtt ctatcgttcc ggacgatcac aaactggaag ttgafc atggg tgatatcggt 18 0 

60 gccgaaaaac tgaaaaataa cggcaccacc acgccgaaaa gtttccagat tcgtctgcaa 240 

gattgcgtat ttgatactca ggaaacaatg accactacct ttacfcggtac cgtttcttct 3 00 

gcaaatagcg gcaattatta caccattttc aataccgata ccggfccjcggc atttaacaat 360 

gttagcctgg cgatcggtga ctctctgggt acctcttaca aaagcggcat gggtattgac 420 

cagaaaatag tgaaagacac ttctaccaac aaaggcaaag cgaag-cagac actgaacttt 48 0 

65 aacgcctggc tggtcggcgc agctgatgcg ccagatctgg gtaattttga agcaaatacc 540 

accttccaga ttacttacct gtaa 564 
<212> Type : DNA 
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<211> Length : 564 

SequenceName : SEQ ID 563 
SequenceDescription : 



5 Sequence 

<213> OrganismName : Shigella flexneri 2a str. 2457T 
<400> PreSequenceString : 

atgaaaatta aaactctggc aatcgttgtt ctgtcggctc tgtccctcag ttccgcagcg 60 
10 gctctggccg atactacgac ggtaaatggt gggaccattc actttaaagg ggaagttgtt 120 
aacgccgctt gcgcagttga tgcaggctct gttgatcaaa ccgttcagtt gggacaggtt 180 
cgtaccgcta gcctgaagca ggctggagca accagctctg ccgttggttt taacattcag 240 
ctgaatgatt gcgataccac tgttgccaca aaagccgctg ttgccttctt aggtacggca 3 00 

attgatgcta cgcgtactga, tgtactggct ctgcagagtt ctgctgcagg tagtgcaaca_ 360 
15 aacgttggcg tgcagatcct agacagaaca ggcaatgctc tgacgctgga cggtgcgaca 420 
tttagtgcac aaacaaccct gaataacggt accaacacca ttccgttcca ggcgcgttat 480 
tatgcaatcg gcgaggcaac cccgggtgca gctaatgcgg atgcaaccfct caaggttcag 540 
tatcaataa 549 
<212> Type : DNA 
20 <211> Length : 549 

SequenceName : SEQ ID 564 
SequenceDescription : 



25 



Sequence 



<213> OrganismName : Shigella flexneri 2a str. 2457T 
<400> PreSequenceString : 

atggcaagta tttcatcgct gggagtcggg tcaggtctgg atttaagttc aatccttgat 60 

agcctcaccg ccgcgcaaaa agcgacgcta acccctattt caaatcagca atcgtcgttt 12 0 

30 accgctaaac ttagcgccta cggtacgctg aaaagcgcgc tgacgacttt ccagaccgcc 180 

aatactgcat tgtctaaagc cgatcttttt tccgccacca gcaccaccag cagcaccacc . 240 

gcgttcagtg ccaccaccgc gggtaacgcc atcgccggga aatacaccat cagcgtcacc 300 

catctggcgc aggcgcaaac cctgaccacg cgcaccacca gagatgatac gaaaacggcg 3 60 

atcgccacca gcgacagcaa actcaccatt caacaaggcg acgacaaaga tccgattacc 42 0 

35 attgatatca gcgcggctaa ctcatcgtta agcgggatcc gtgatgccat caacaacgca 480 

aaagcgggcg taagtgcgag catcattaac gtgggtaacg gtgaatatcg tctgtcagtc 540 

acatcaaatg acaccggcct tgataatgcg atgacactct cggtcagcgg tgatgatgcg 600 

ctacaaagtt ttatgggcta tgacgccagt gccagcagca acggtatgga ggtctcggtc 660 

gccgcccaga atgcgcagct gacggtcaac aacgtcgcca tcgagaacag cagcaacact 720 

40 atcagcgacg cgctggaaaa catcactctg aacctgaacg atgtcaccac gggcaaccag 78 0 

acgctaacca tcactcagga cacctccaaa gtgcaaacgg cgattaaaga ctgggtgaat 840 

gcctataact cgctaataga taccttcagc agcctgacca aatacaccgc cgtagatgcg 90 0 

ggagctgata gccagagttc tagcaatggc gcactgctcg gcgactccac gctgcggacg 960 

attcagacgc agttgaagtc gatgctgagt aataccgtca gttcttccag ctataaaacg 102 0 

45 ctggcgcaga ttggtatcac gaccgatccc agcgatggca aactggaact ggatgccgac 108 0 

aaactcaccg ctgcactgaa aaaagatgcc agcggcgtag gtgcattgat tgttggcgat 1140 

ggtaaaaaaa ccggcatcac aaccaccatc ggcagcaacc tgaccagttg gctttcgaca 120 0 

acgggcatta ttaaagccgc taccgatggc gttagtaaaa ccctgaataa gttaactaaa 1260 

gactacaacg ccgccagcga tcgcattgat gcgcaggtcg cgcgctacaa agaacaattt 1320 

50 acccaactgg acgttttaat gacctcgtta aacagcacca gcagctactt aacgcagcag 1380 

ttcgaaaaca acagtaattc caagtaa 1407 
<212> Type : DNA 
<211> Length : 1407 

SequenceName : SEQ ID 565 

55 SequenceDescription : 



Sequence 



<213> OrganismName : Shigella flexneri 2a str. 2457T 

60 <400> PreSequenceString : 

gtggagggta aagctgataa tgtcgtactg gaaaatggcg gacgcctgga tgtgctgacc 60 

ggacacacag ccactaatac ccgcgtggat gatggcggaa cgctggatgt ccgcaacggt 12 0 

ggcaccgcca ccaccgtatc catgggaaat ggcggtgtac tgctggccga ttccggtgcc 180 

gctgtcagtg gtacccggag cgacggaaag gcattcagta tcggaggcgg tcaggcggat 240 

65 gccctgatgc tggaaaaagg cagttcattc acgctgaacg ccggtgatac ggccacggat 300 

accacggtaa atggcggact gttcaccgcc aggggcggca cactggcggg caccaccacg 360 

ctgaataacg gcgccatact taccctttcc gggaagacgg tgaacaacga taccctgacc 420 
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atccgtgaag gcgatgcact cctgcaggga ggcgctctca ccggtaacgg cagcgtggaa 480 

aaatcaggaa gtggcacact cactgtcagc aacaccacac tcacccagaa agccgtcaac 54 0 

ctgaatgaag gcacgctgac gctgaacgac agtaccgtca ccacggatgt cattgctcag 600 

cgcggtacag ccctgaagct gaccggcagc actgtgctga acggtgccat tgaccccacg 660 

5 aatgtcactc tcgcctctgg tgccacctgg aatattcccg ataacgccac ggtgcagtcg 720 

gtggtggatg acctcagcca tgccggacag attcatttca cctccacccg cacagggaag 780 

ttcgtaccgg caaccctgaa agtgaaaaac ctgaacggac agaatggcac catcagcctg 840 

cgtgtacgcc cggatatggc acagaacaat gctgacagac tggtcattga cggtggcagg 90 0 

gcaaccggaa aaaccatcct gaacctggtg aacgccggca acagtgcgtc ggggctggcg 960 

10 accagcggta agggtattca ggtggtggaa gccattaacg gtgccaccac ggaggaaggg 102 0 

gcctttatcc aggggaataa gctgcaggcc ggtgccttta actactccct caaccgggac 10 80 

agtgatgaga gctggtatct gcgcagtgaa aatgcttatc gtgcagaagt ccccctgtat 114 0 

gcctccatgc tgacacaggc aatggactat gaccggattc tggcaggctc ccgcagccat 12 0 0 

cagaccggtg taagcggtga aaataacagc gtccgtctca gcattcaggg , cggtc.atctc .JL260 

15 ggtcacgata acaacggcgg tattgcccgt ggagccacgc cggaaagcag cggcagctat 13 20 

ggcttcgtcc gtctggaggg tgacctgctc agaacagagg ttgccggtat gtctgtgacc 13 80 

gcgggggtat atggtgctgc tggccattct tccgttgatg ttaaggatga tgacggctcc 1440 

cgtgccggca cggtccggga tgatgccggc agcctgggcg gatacctgaa tctgatacac 150 0 

aacgcctccg gcctgtgggc tgacattgtg gcccagggaa cccgccacag catgaaagcc 1560 

20 tcatcggaca ataacgactt ccgcgtccgg ggctggggct ggctgggttc gctggaaacc 162 0 

ggtctgccct tcagtatcac tgacaatctg atgctggagc cgcaactgca gtacacctgg 168 0 

cagggactct ccctggatga cggccaggat aacgccagtt atgtgaagtt cgggcatggc 1740 

agtgcacaac atgtgcgtgc cggcttccgt ctgggcagcc accacgatat gaactttggt 18 00 

aaaggcacct catcccgtga caccctgcgc ggcagtgcca aacacagtgt gcgtgaactg 1860 

25 ccggtgaact ggtgggtaca gccttctgtt atccgcacct tcagctcccg gggggacatg 1920 

agcatgggta cagccgcagc cggcagtaac atgacgttct caccgtcaca gaatggtacg 198 0 

tcactggacc tgcaggccgg actggaagcc cgtgtccggg aaaatatcac cctgggcgtt 2040 

caggccagtt atgcccacag catcaacggc agcagcgctg aaggttataa cagtcaggcc 2100 

acactgaatg taaccttctg a 2121 

30 <212> Type : DNA 

<211> Length : 2121 

SequenceName : SEQ ID 566 
SequenceDescription 

35 Sequence 

<213> OrganismName : Shigella flexneri 2a str. 2457T 
<400> PreSequenceString : 

atggcctttt ctcaagcggt tagcggatta aacgctgccg ccaccaacct cgatgttatt 60 

40 ggcaacaata tcgccaactc cgccacctac ggctttaaat caggcacggc ctcttttgcc 120 

gatatgtttg ccggttcgaa agtgggactg ggggtaaaag ttgccggtat cactcaggac 180 

tttaccgatg gcacgaccac caacaccggg cgaggtctgg acgttgctat cagccagaac 240 

ggtttttfccc gtctggtaga cagcaacggt tcggtgttct acagccgtaa cggacaattt 3 00 

aagctggatg aaaaccgtaa cctgttgaat acgcaaggtt tacagctgac gggttacccg 3 60 

45 gtaaccggta cgccgccgac tattcagcaa ggggcgaatc cgaccaatat ttcgatcccg 42 0 

aataccctga tggcagcgaa aactaccacc acggcgtcga tgcagatcaa cctgaattcc 480 

agtgatccgc ttcctactgt tacgccattc agcgccagca atgcggatag ctataacaaa 540 

aaaggttcgg tgactgtttt cgacagtcag ggtaatgctc atgacatgag cgtctatttt 600 

gtgaagaccg gggataataa ctggcaggtc tacacccagg atagcagtga tccaaacagc 660 

50 attgcgaaga cagcgacaac actggaattt aatgctaatg gcacattagt ggatggtgcg 720 

atggcgaata atatcgcaac cggcgcaatt aacggtgcag aacccgccac gtttagtctg 78 0 

agcttcctca actccatgca gcaaaatacc ggcgctaaca acattgtggc aaccacccag 840 

aatggctaca aaccgggcga tctggtgagt tatcaaatca atgatgacgg tacggttgtc 900 

ggcaacaatt ccaacgaaca aacccaactg ctggggcaga ttgtactggc gaactttgcc 960 

55 aacaacgaag gtctggcatc cgaaggcgac aacgtctggt ctgcgacgca atcttctggc 1020 

gtggcgctgt tggggacagc cgggacggga aactttggca ctctgaccaa cggtgcgctg 1080 

gaagcgtcca acgtcgatct cagtaaagaa ctggtcaata tgatcgttgc ccagcgtaac 1140 

tataagtcta acgcccagac catcaaaacc caggaccaga tcctcaacac gcgggttaac 12 00 

ttacgctaa 1209 

60 <212> Type : DNA 

<211> Length : 1209 

SequenceName : SEQ ID 567 
SequenceDe script ion : 

65 Sequence 

<213> OrganismName : Shigella flexneri 2a str. 2457T 
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<400> PreSequenceString : 
atgaaactcg tgcacatggc cagtggttta 
gataaaagcg cggatattca gacgccagcc 
caacaacctg ctatccagca accgaatgtc 
gcactgccgc ctgatgctgt gctgaccgtg 
ccgtcaaaag tgctggcgca gaaagcggtg 
tttgttctgc catttaaccc ggcagatgtt 
gcgattaccg tgaatgacaa actggtattt 
cagggcggaa ctaaagccga cctgacattg 
caggccagcg gtggcgcaac gactaccgta 
tcttcggcag ttcccgctcc tacgcaatat 
<212> Type : DNA 
<211> Length : 573 

. SequenceName. : SEQ ID 568 
SequenceDescription : 

Sequence 



<213> OrganismName : Shigella flexneri 2a str. 2457T 
<400> PreSequenceString : 

atgatcataa aaaaaagcgg tggtcgctgg cagctaagcc tgctggcgag cgtggtaatc 60 

agtgcctttt ttctcaacac agcttacgcc tggcaacaag aatatatcgt tgatacgcaa 120 

cccggacatt ccacagagcg ttacacctgg gatagtgatc atcaacctga ttacaacgat 180 

attttgtcgc aacgtattca aagtagccaa agggcgctgg gactggaagt caatctggcg 240 

gaagaaactc ctgtggatgt gaccagcagt atgagtatgg gctggaattt tcctttatat 3 00 

gaacaggtta caaccggccc ggtcgcggca ttacattacg atggcacaac cacctcgatg 3 60 

tataacgagt ttggcgacag tactaccacg ctgaccgatc cgttatggca tgccagcgtg 420 

agtagcttag gctggcgtgt tgactcccgg cttggcgatc tccgaccctg ggcgcaaatc 4 80 

agctataacc agcaatttgg cgagaatatc tggaaggcgc aatcaggcct gagccggatg 540 

acggcgacaa accagaacgg caactggctg gatgtcaccg taggcgctga tatgttgctc 600 

aatcaaaata ttgccgccta tgctgcgcta actcaggcag aaaataccac taataatagc 660 

gactatctgt atacgatggg ggttagcgcc agattttaa 699 
<212> Type : DNA 
<211> Length : 699 

SequenceNarae : SEQ ID 569 

SequenceDescription : 

Sequence 



<213> OrganismName : Shigella flexneri 2a str. 2457T 
<400> PreSequenceString : 

atgaaatggt gcaaacgtgg gtatgtattg gcggcaatgt tggcgctcgc aagtgcgacg 60 

atacaggcag ccgatgtcac catcacggtg aacggtaagg tcgtcgccaa accgtgtacg 12 0 

gtttccacca ccaatgccac ggtagatctc ggcgatcttt attctttcag tcttatgtct 18 0 

gccggggcgg catcggcctg gcatgatgtt gcgcttgagt tgactaattg tccggtggga 240 

acgtcgaggg tcactgccag cttcagcggg gcagccgaca gcaccggata ttataaaaac 3 00 

caggggaccg cgcaaaacat ccagttagag ctacaggatg acagtggcaa cacattgaat 3 60 

actggcgcaa ccaaaacagt tcaggtggat gattcctcac aatcagcgca cttcccgtta 42 0 

caggtcagag cattgacggt aaatggcgga gccactcagg gaaccattca ggcagtgatt 480 

agcatcacct atacctacag ctga 504 
<212> Type : DNA 
<211> Length : 504 

SequenceName : SEQ ID 570 

SequenceDescription : 

Sequence 



<213> OrganismName : Shigella flexneri 2a str. 2457T 
<400> PreSequenceString ; 

atgaaaagag cgcctcttat aacaggactt ttgttgatat ccacatcctg cgcttatgcc 60 

tcctcaggag ggtgtggagc cgacagcact agcggtgcga caaattacag cagtgtggtt 120 

gatgatgtta cggtgaacca gacagataac gtgacaggac gggagtttac ctctgcaacg 18 0 

ctaagtagca ctaactggca atacgcctgt tcctgctctg cgggtaaggc agttaaactt 240 

gtctatatgg tcagccccgt acttaccacc actggacatc agacaggata ttacaaactc 3 00 

aatgacagcc tggatattaa aaccacatta caggcaaacg acattccagg actcacaacc 3 60 

gaccaggttg tctctgttaa cacccgattc acacagataa aaagcagcac cgtatattct 420 

gctgcaaccc aaacgggtgt ttgccagggt gacacgtctc gttatggacc cgttaatatt 480 



gcggttgcga ttgcgttggc 
ccggctgcaa atacgtctat 
tccggtactg tctggatccg 
acactttctg acgcgtcgtt 
cgtactgaag gtaaacagtc 
cagccgaacg cgcgtattct 
atcaccgata ccgttcagcc 
gtgccggtac agcaaaccgc 
ccttcgactt caccaactca 
taa 



ggcttgcgca 60 

ttcagcaaca 120 

tcagaaagtc 180 

agccgatgca 240 

accattcagc 3 00 

gttgagtgcg 3 60 

ggtgatcaac 420 

cgtgccggtt 480 

ggtgaatccg 540 

573 
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ggtgcaaata ccacctttac cctgtatgtc 
ccgaaaacgg atattgccgt cattaaaggc 
acaggtgact tccatgattt agtcaagtta 
tcgtgcaaaa ttaatcaggg cgatgttatt 
5 aagtttacca cccgcaatgc catgccagac 
tatgactgtg gtgatacttc aaagattaaa 
actggggtag tagaccagta caacctggtc 
gatgtcggta ttcgtattga aaatctcggc 
ggtatccttc ccgttgatcc ttccgggcat 

10 aatctggtcg gtggtgagct ggaaacagga 
atggtgcggt aa 
<212> Type : DNA 
<211> Length : 1092 

•SequenceName : • SEQ ID 571 

15 SequenceDe script ion : 

Sequence 



<213> OrganismName : Shigella flexneri 2a str. 2457T 

20 <400> PreSequenceString : 

atgcaaaaaa acgctgcgca tacttatgcc atttccagct tgttggtgct ttcactaacc 60 

ggctgcgcct ggataccctc cacgccgctg gtgcaggggg cgaccagtgc acaaccggtt 120 

cccggtccga cgcccgtcgc caacggttct attttccagt ctgctcagcc gattaactat 18 0 

ggctatcaac cgctgtttga agatcgtcga ccacgcaata ttggcgatac gctgaccatc 240 

25 gtgttgcagg agaacgtcag cgccagcaaa agctcctctg cgaatgccag ccgtgacggt 3 00 

aaaactaatt ttggctttga tactgtgccg cgctatttgc aggggctgtt tggtaacgct 3 60 

cgtgccgatg tcgaagcctc cggtggtaac acgttcaacg gaaagggcgg ggccaatgcc 42 0 

agcaatacct ttagcggcac gttgacggtg acggttgacc aggtactggt caacggcaac 4 80 

ctgcatgtgg tgggtgaaaa acagattgcc attaatcagg gtaccgaatt tattcgcttc 54 0 

30 tcgggcgtgg ttaatccacg cactatcagc ggcagcaata ccgtaccgtc tactcaggtg 60 0 

gcggatgcgc gtattgaata cgtaggcaat ggctacatta acgaagcgca aaatatgggc 660 

tggttgcagc gtttcttcct taacctgtcg ccaatgtaa 699 
<212> Type : DNA 
<211> Length : 699 

35 SequenceName r SEQ XD 572 

SequenceDescription : 

Sequence 



40 <213> OrganismName : Shigella flexneri 2a str. 2457T 

<400> PreSequenceString : 

atgaaacgac atctgaatac ctgctacagg 

gtggttgcct ccgaactggc ccgcgcacag 

tctcttgccg cggtcacgtc actcccggtg 
45 gaaacagtga atggcggaac actggtaaac 

gatggcgtga ctgtcagtac cgggcttgag 

gggcaatgga taaaagcggg tggcacaggc 

cagattgtgc aggcaggagg aactgccagt 

agccttaacg gactggcggt gaacaccacg 
50 gggggaggga aagctgccgg tacaattatt 

ggcggactgg caaccggaac catcgtcaac 

aatgtgtcca gcggtcagat ggtcggaggg 

ggccggcagg ttatctggtc ttcggggatg 

gaccagacgg tacacggaga ggcacataac 
55 cacaacggtg gcacggcaac agagacgctg 

gaaggaggaa ctgccgcgca taccaccatc 

<212> Type : DNA 
<211> Length : 894 
60 SequenceName : SEQ ID 573 

SequenceDescription : 

Sequence 



65 <213> OrganismName : Shigella flexneri 2a str. 2457T 
<4 00> PreSequenceString : 

atgatgatga aaactattaa acatcttctg tgctgtgcca ttgccgccag cgcattaatt 60 



accaagccat ttctcggctc 
gcgtgggtcg atggaatggg 
tcgattcagg gaaatctcac 
aaagttaatt ttggattcat 
ggttttactc cagtagactt 
aactcgttgc aaatgcgcat 
gccaggcgaa gaagttcaga 
ggcggagttg caaatattcc 
ggcaccgtca atatgcgcgc 
aaatttcagg gcacagccac 



gatgaccatt 540 

aagcccgtct 600 

cgccccacag 660 

caatggtcag 72 0 

tgatatcact 780 

cgacggtaca 840 

caatgtgccc 900 

ttttcagaac 960 

ctggccagtt 102 0 

cattaccgtc 1080 

1092 



ctggtatgga atcacattac 
ggtaaacgtg gcggtgtggc 
ctggctgctg acatcgttgt 
catgacaacc agtttgtatc 
ctggggccgg acagtgacga 
agaaacacca ctgtcaccgc 
gatacggtta ttcgtgatgg 
ctggataaca gaggtgagca 
aaccaggatg gttaccagac 
accggtgcag aaggtggtcc 
acggctgaat ccaccaccat 
gcacgggaca ccctcattta 
acccgactgg aggggggtaa 
ataaaccgtg atggctggca 
aaccagaaag aaagctgcag 



gggcgctttc 60 

ggttgcactg 12 0 

gcacccgggt 180 

cggaacagct 24 0 

aaacaccggc 3 00 

aaatggtcgt 3 60 

cggagggcag 420 

gtgggtacac 4 80 

cataaaacat 540 

ggagtctgaa 600 

caataaaaat 660 

cgccggtggt 720 

ccagtatgta 780 

ggtgattaag 840 

gtga 8 94 
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tccaccgggg tgcatgctgc gtcctggaaa 
ggcaaccaaa acagcacgac acaggaaggc 
cttagcagcg gaaaccaagc cttaagcgca 
caatactgcg cgaagcaaaa gctggcttcg 
5 gtgctggaaa agctgggcct gaacagtgaa 
ggtattcagg gtttgctgaa aacaaaagat 
acgactccgc tggcagaaaa ggtgaaaacc 
ctgaacttca tttcctga 
<212> Type : DNA 
10 <211> Length : 498 

SequenceName : SEQ ID 574 

SequenceDe script ion : 

Sequence - 
15 

<213> OrganisraName : Shigella flexneri 2a str. 2457T 
<400> PreSequenceString : 

atgtttaaag gacaaaaaac attggccgca ctggccgtat ctctgctgtt cacagcacct 60 
gtttatgcgg ctgatgaagg atccggtgaa attcacttta aaggtgaagt tattgaagca 12 0 

20 ccgtgtgaaa tacatcagga tgatattgat aaagaggttg aactcggtca ggtgaccacc 18 0 

agccacatta atcagtcaca tcacagcgat gccgttgctg tcgacctgct cttagtcaac 240 
tgtgatctgg aaaactccag caacggttcc ggtggcaaga tttctaaagt tgcagtcacc 3 00 

tttgatagct cagcgaaaac caccggcgca gatccgattc tcaacaacac cagcacgggt 3 60 

gaagccaccg gcgtcggcgt acgtttaatg aataaagatc aaagcaatat cgttttaggc 420 

25 accgctactc cagatatcga cctggctccg acctccagcg aacaaacgct gaatttcttt 480 
gcctggatgg aacaaattga tcaggctaca cctgtaacgc caggcgcagt tacggcgaat 540 
gcaacctacg tgctggatta taaataa 567 
<212> Type : DNA 
<211> Length : 567 

30 SequenceName : SEQ ID 575 

SequenceDescription : 

Sequence 

35 <213> OrganismName : Shigella flexneri 2a str. 2457T 

<400> PreSequenceString : 

atgagcgcgg gaagcccaaa attcaccgtt 

ctatggctgg caggctgttc tgacacttca 

ggcaatgcgc ctgcaaatac taattctggt 

40 acgacgtcta cagcgcagca accgcaaatt 

actcaacaac cgcaaatcca gccagtgcag 

aacggacgca tcgtctataa ccgtcagtat 

agtacctata ccgtgaaaaa aggcgacaca 

gatttccgtg accttgctca gcgcaacaat 

45 cagaccttgc aggtgggtaa tgcttccggt 

caggccgacg cagcagagca aggagttgtg 

gttgcgtcgc aaccgacaat tacgtattct 

atgttgccga acaacaagcc aactgcgacc 

gcaagcacaa ccgagccgac tgtcagcagt 

50 cgctggccga ctgagggcaa agtgatcgaa 

gggattgata tcgcaggcag caaaggacag 

gtttatgctg gtaacgcgct gcgcggctac 

gattacctga gtgcctacgc ccataacgac 

aaggcggggc aaaaaatagc aaccatgggt 

55 tttgaaattc gttacaaggg gaaatccgta 

<212> Type : DNA 
<211> Length : 1140 

SequenceName : SEQ ID 576 
60 SequenceDescription : 

Sequence 



<213> OrganismName : Shigella flexneri 2a str. 2457T 
65 <4 00> PreSequenceString : 

gtgattaaat ttctctctgc attaattctt ctactggtta cgacggcggc tcaggctgag 60 
cgtattcgcg atctcaccag tgttcagggg gtaaggcaaa actcactgat tggctatggc 120 



gatgcgctct 


ccagcgccgc 


cagcgaactt 


120 


ggttggtcgc 


tcgcgtcatt 


aactaacttg 


180 


gataacatga 


acaatgccgc 


aggcattctg 


240 


gtaaccgatg 


ccgaaaacat 


caagaaccag 


300 


gagcaaaaag 


aagacaccaa 


ctatctggac 


360 


ggtcagcaac 


tcaatctgga 


taacatcgga 


420 


aaagcctgcg 


atctggtgtt 


aaagcagggg 


480 








498 



cgccgcattg cggctttgtc actggtttcg 60 

aatccaccgg caccggtcag ctccgttaat 120 

atgttgatta cgccgccgcc gaaaatgggg 180 

cagccggtac agcagccaca aattcaggct 240 

ccagtagctc agcagccggt acagatggaa 3 00 

gggaacattc cgaaaggcag ttatagcggc 3 60 

cttttctata tcgcctggat tactggcaac 420 

attcaggcac catacgcgct gaacgttggt 480 

acgccaatca ctggcggaaa tgccattacc 540 

atcaagcctg cacaaaattc caccgttgct 60 0 

gagtcttcgg gtgaacagag tgctaacaaa 660 

acggtcacag cgcctgtaac ggtaccaaca 72 0 

acatcaacca gtacgcctat ctccacctgg 780 

acctttggcg cttctgaggg gggcaacaag 840 

gcaattatcg cgaccgcaga tggccgcgtt 900 

ggtaatctga ttatcatcaa acataatgat 960 

acaatgctgg tccgggaaca acaagaagtg 1020 

agcaccggaa ccagttcaac acgcttgcat 10 80 

aacccgctgc gttatttgcc gcagcgataa 114 0 
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ttggtagtgg ggctggatgg cactggtgac cagacaaccc agacgccgtt taccacacaa 180 

acgcttaata acatgctctc acagctggga attaccgttc cgacgggcac caatatgcag 24 0 

ctaaaaaacg tcgctgcggt aatggtgaca gcgtcacttc ctccgtttgg acgtcagggg 3 00 

caaaccatcg atgtggtggt ttcttccatg ggtaatgcca aaagcttgcg tggaggtacg 3 60 

ttgttgatga caccgcttaa gggcgttgac agtcaggtgt atgcgctggc gcagggcaat 42 0 

attctggttg gcggcgcagg agcctccgca ggcggtagca gtgttcaggt taaccaactg 48 0 

aacggtggac ggatcaccaa tggtgcggtt attgaacgtg aattgcccag ccagtttggc 540 

gtcgggaata cccttaattt gcaacttaac gacgaagatt tcagcatggc gcagcaaatc 600 

gctgacacca tcaaccgcgt gcgtggatat ggcagcgcca ccgcgttaga tgcgcggact 660 

attcaggtgc gcgtaccaag tggcaacagt tcccaggtcc gtttccttgc cgatatccag 72 0 

aatatgcagg ttaatgtcac cccgcaggac gctaaagtag tgattaactc gcgcaccggt 780 

tcggtggtga tgaatcgcga agtgaccctc gacagctgcg cggtagcgca gggaaatctc 840 

tcagtaacag ttaatcgtca ggccaatgtc agccagccag atacaccgtt tggtggtgga 900 

cagactgttg ttactccaca aacgcagatc gatttacgcc agagcggcgg ttcgctgcaa- 960 

agcgtacgtt ccagcgccag cctcaataac gtggtgcgtg cgctcaatgc gctgggcgct 102 0 

acgccgatgg atctgatgtc tattttgcaa tcaatgcaaa gtgcgggatg tctgcgggca 108 0 

aaactggaaa tcatctga 1098 
<212> Type : DNA 
<211> Length : 1098 

SequenceName : SEQ ID 577 

SequenceDe script ion : 



Sequence 



<213> OrganismName : Shigella flexneri 2a str. 2457T 
<400> PreSequenceString : 

atgaaacgtt caattattgc tgccgctgtc ttttcttctt tttttatgag cgctggagta 60 

tttgctgcag acgttgatac cggaacatta accattaaag gaaatattgc agaatctccg 12 0 

tgtaaattcg aagcgggtgg tgattcagta agtattaata tgccgactgt accaaccact 180 

gtctttgaag gtaaagctaa atattctacc tatgatgatg cagtcggtgt aaccagcagc 240 

atgttaaaaa ttagctgccc gaaagaagtt gctggtgtaa aactctcgtt gatcacaaac 300 

gacaaaataa ccggtaacga taaggcgata gccagtagca acgataccgt aggtgataac 3 60 

agcgatgtcc tagatgtttc tgcacctttt aacattgaga gttataaaac agcggaaggt 420 

caatatgcta ttccgtttaa agcaaaatac ctgaaactga cagataactc agtgcaatca 480 
ggtgatgtgt tatcttctct ggttatgcgt gtggcgcagg attaa 525 
<212> Type : DNA 
<211> Length : 525 

SequenceName : SEQ ID 578 

SequenceDe script ion : 

Sequence 



<213> OrganismName : Shigella flexneri 2a str. 2457T 
<400> PreSequenceString : 

atggcggttc aaaagaatgt tatcaaaggc atactggcag gtacgtttgc gctaatgctg 60 

agcggttgtg tcactgtgcc ggacgccatt aaaggcagca gtaccacgcc gcaacaagat 12 0 

ttagttcggg tgatgagtgc gccgcagctg tacgttggtc aggaggcacg ctttggtggc 180 

aaagtggttg cggtacaaaa ccagcaaggg aaaacccgcc tggaaattgc taccgtaccg 240 

ctggacagcg gagctaggcc gacgctggga gaaccttctc gcggtcgcat ttatgccgat 3 00 

gtgaacggtt ttctggaccc ggtggatttc cgtggacaac tggttacggt agtcgggcca 360 

atcaccggtg cggttgacgg caaaatcggc aatacgccct ataaatttat ggtgatgcaa 42 0 

gtaacgggtt acaaacgttg gcatttaacc cagcaggtga ttatgccgcc tcagccgatt 48 0 

gatccatggt tttatggcgg tcgtggctgg ccctatggct acggcggatg gggctggtat 540 
aatcccggcc ccgcgagagt acaaacagtt gtaaccgaat aa 582 
<212> Type : DNA 
<211> Length : 582 

SequenceName : SEQ ID 579 

SequenceDescription : 



Sequence 



<213> OrganismName : Shigella flexneri 2a str. 2457T 
<4 00> PreSequenceString : 

atgagaaaca aaccttttta tcttctgtgc gcttttttgt ggctggcggt aagtcgcgtt 60 

ttggctgcgg atagcacgat tactatccgc ggctatgtca gagataacgg ctgtagtgtg 120 

gccgctgaat caaccaattt tactgttgat ctgatggaaa acgcggcgaa gcaatttaac 18 0 

aacattggcg cgacgactcc tgtcgttcca tttcgtattt tgctgtcacc ctgtggtaat 240 
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gccgtttctg ccgtaaaagt tgggtttacc ggcgttgcag atagccacaa. tgccaacctg 300 

cttgcacttg aaaatacggt gtcagcggct gcgggactgg gaatacagct tctgaatgag 3 60 

cagcaaaatc aaatacccct taatgctcca tcgtctgcga tttcgtggac gaccctgacg 420 

ccgggtaaac caaatacgct gaatttttac gcccggctaa tggcgacaca ggtgcctgtc 48 0 

actgcggggc atatcaatgc tacggctacc ttcactcttg aatatcagta a 531 

<212> Type : DNA 
<211> Length : 531 

SequenceName : SEQ ID 58 0 

SequenceDescription : 

Sequence 



<213> OrganismName Shigella flexneri 2a str, 2457T 
<4 00> PreSequenceString : 

atgaaaaagt taacagtggc ggctttggca gtaacaactc ttctttctgg cagtgccttt 60 

gcgcatgaag caggcgaatt ttttatgcgt gcaggttctg caaccgtacg tccaacagaa 12 0 

ggtgctggtg gtacgttagg aagtctgggt ggattcagcg tgaccaataa cactcaactg 18 0 

ggcttgacgt ttacttatat ggcgaccgac aacattggtg tggaattact ggcagcgacg 240 

ccgttccgcc ataaaatcgg cacccgggcg accggcgata ttgcaaccgt tcaccatctg 3 00 

ccgccaacac tgatggcgca gtggtatttt ggtgatgcca gcagcaaatt ccgtccttac 360 

gttggggccg gtattaacta caccaccttc tttgataatg gatttaacga tcatggcaaa 42 0 

gaggcggggc tttccgatct cagtctgaaa gattcctggg gage tgc egg gcaggtgggg 480 

gttgattatc tgattaaccg tgactggttg gttaacatgt cagtgtggta catggatatc 54 0 

gataccaccg ccaattataa getaggeggt gcacagcaac aegatagegt acgcctcgat 60 0 

ccgtgggtgt ttatgttctc agcaggatat cgtttttaa 63 9 
<212> Type : DNA 
<211> Length : 639 

SequenceName : SEQ ID 581 

SequenceDescription : 

Sequence 



<213> OrganismName : Shigella flexneri 2a str. 2457T 
<400> PreSequenceString : 

ttgtttttta agegaggaaa gattttgagt gcgggacgcc tgaataaaaa atctctgggt 60 

atcgtgatgt ttttatcggt tggactgeta ttggcgggct gttegggtag caaatcatcc 12 0 

gatacaggaa egtattcegg ctccgtttac acegtgaaac ggggggatac getatategt 180 

atttcgegea ccacgggaac cagegtaaaa gagctggege gaetgaaegg catttccccc 240 

ccttacacca ttgaagttgg tcagaaacta aaactgggtg gggcgaaaag tagcagtagt 3 00 

acaegtaaat caaccgccaa atcaacgact aaaacegcat cggttacacc gtcatcagcg 3 60 

gtaccgaaat cttcctggcc gecagtaggg caacgttgtt ggttatggcc aacgacaggg 42 0 

aaagttatca tgeegtatte gacagcagat ggeggcaata aagggattga tatctcagct 480 

ecaeggggta cacctattta cgccgcgggt gcaggaaagg tggtgtatgt gggcaaccag 540 

ctgcgtggct aeggtaatet catcatgatt aaacacagtg aagattacat taeggcttae 600 

geccataatg acacgatget ggtaaataat gggcaaagcg tgaaggctgg gcaaaaaatc 660 

gecactatgg ggagcacgga tgeggcatet gttcgcctgc atttccagat tcgttaccgt 720 

geaaeggcaa ttgatccget acgttacttg ccgccgcagg gcagcaagcc aaaatgctga 780 

<212> Type : DNA 
<211> Length : 780 

SequenceName : SEQ ID 582 

SequenceDescription : 

Sequence 



<213> OrganismName : Shigella flexneri 2a str. 2457T 
<400> PreSequenceString : 

atggcacaag tcattaatac caacagcctc tegctgatea ctcaaaataa tatcaacaag 60 

aaccagtctg cgctgtcgag ttctatcgag cgtctgtctt ctggcttgcg tattaacagc 12 0 

gcgaaggatg acgccgcggg teaggegatt getaacegtt ttacttctaa cattaaaggc 180 

ctgactcagg ctgcacgtaa cgccaacgac ggtatttctg ttgeacagae cactgaaggc 240 

gcgctgtccg aaatcaacaa caacttacag cgtatccgtg agetgaeggt tcaggcttct 3 00 

acegggacta actctgattc ggatctggac tccattcagg acgaaatcaa atcccgtctc 3 60 

gacgaaattg accgcgtatc cggtcagacc cagttcaacg gcgtgaacgt actggcaaaa 420 

gaeggttega tgaaaattca ggttggtgcg aatgaeggee agactatcac tattgatctg 480 

aagaaaattg actctgatac gctggggctg aatgggttta atgtgaatgg tggegggget 540 
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gtggctaata ctgctgcatc taaagctgac ttggtagctg ctaatgcaac agtggtcggc 60 0 

aacaaatata ctgtgagtgc gggttacgat gctgctaaag cgtctgattt gctggctgga 660 

gttagtgatg gtgatactgt tcaggcaacc attaataacg gcttcggaac ggcggctagt 72 0 

gcaacgaatt acaagtatga cagtgcaagt aagtcatact cttttgatac cacaacggct 78 0 

tcagctgccg atgttcagaa atatttgacc ccgggagttg gtgataccgc taagggcact 840 

attactatcg atggttctgc acaggatgtt cagatcagca gtgatggtaa aattacggca 90 0 

agcaatggag ataaactcta cattgataca actgggcgct taacgaaaaa cggctctggt 960 

gcttctttga ctgaggctag tctgtccaca cttgcagcca ataataccaa agcgacaacc 102 0 

attgacattg gcggtacctc tatctccttt accggtaata gtactacgcc ggacactatt 108 0 

acttattcag taacaggtgc aaaagttgat caggcagctt tcgataaagc tgtatcaacc 1140 

tcgggaaaca atgttgattt cactactgca ggttatagcg tcaacggcac aactggcgct 120 0 

gtaacaaaag gtgttgattc ggtttatgtt gataacaacg aggcgttgac cacatctgat 1260 

actgtagatt tttacctaca ggatgatggt tcagtgacta acggcagcgg taaggcagtt 132 0 

tataaagatg ctgatggtaa attgacgaca gatgctgaaa caaaagcagc tacgacagcc 138 0 

gatcccctga aagctctgga tgaagccatc agctccatcg acaaattccg ctcctccctg 1440 

ggtgcggtgc aaaaccgttt ggattccgca gtcaccaacc tgaacaacac cactaccaac 1500 

ctgtctgaag cgcagtcccg tattcaggac gccgactatg cgaccgaagt gtccaacatg 1560 

tcgaaagcgc agattatcca gcaggccggt aactccgtgc tggcaaaagc caaccaggta 1620 

ccgcagcagg ttctgtctct gctgcagggt taa 1653 
<212> Type : DNA 
<211> Length : 1653 

SequenceName : SEQ ID 583 

SequenceDescription : 

Sequence 



<213> OrganismName : Shigella flexneri 2a str. 2457T 
<400> PreSequenceString : 

atgaaaaaaa ttgcatgtct ttcagcactg gccgcagttc tggctttcac cgcaggtact 60 
tccgtagctg cgacttctac cgtaactggc ggttacgcac agagcgacgc tcagggccaa 12 0 

atgaacaaaa tgggcggttt caacctgaaa taccgctatg aagaagacaa cagcccgctg 180 
ggtgtgatcg gttctttcac ttacaccgag aaaagccgta ctgcaagctc tggtgactac 240 
aacaaaaacc agtactacgg catcactgct ggtccggctt accgcattaa cgactgggca 3 00 

agcatctacg gtgtagtggg tgtgggttat ggtaaattcc agaccactga atacccgacc 360 
tacaaacacg acaccagcga ctacggtttc tcctacggtg ctggtctgca gttcaacccg 420 
atggaaaacg ttgctctgga cttctcttac gagcagagcc gtattcgtag cgttgacgta 480 
ggcacctgga ttgccggtgt tggttaccgc ttctaa 516 
<212> Type : DNA 
<211> Length : 516 

SequenceName : SEQ ID 584 

SequenceDescription : 

Sequence 



<213> OrganismName : Shigella flexneri 2a str. 2457T 
<400> PreSequenceString : 

atgaagcgta atattatagg cggtgcattc actctggcat ctctaatgct ggccgggcat 60 

gcactggcag aagatggtgt tgttaacttc gtcggtgaaa ttgtcgacac tacttgtgaa 12 0 

gttacctccg atacagccga tcaaattgtc ccactgggta aagtcagtaa aaatgcattt 18 0 

tcaggtgtag gtagtctggc gtcgccacag aagttcagta ttaaacttga aaattgcccg 24 0 

gcaacgtaca ctcaagcagc cgttcgtttt gatggtacag aagcgcctgg cggcgacggc 3 00 

gacctgaaag tgggtacgcc gcttacagca ggcaaccctg gtgattttac cggtacagga 360 

caagcgattg cggcaaccgg cgttggtatt cgtattttta accagtccga taattcgcag 42 0 

gttaaacttt ataacgactc tgcttatacc gctatcgatg ctgaaggcaa ggctgaaatg 48 0 

aagtttattg cacgctatgt ggcaaccaat gcgaccgtaa cggctggtac ggcgaacgct 54 0 

gattcacaat ttactgtcga atataagaaa taa 573 
<212> Type : DNA 
<211> Length : 573 

SequenceName : SEQ ID 585 

SequenceDescription : 

Sequence 



<213> OrganismName : Shigella flexneri 2a str. 2457T 
<4 00> PreSequenceString : 

atgaaaaaga gcactctggc attagtggtg atgggcattg tggcatctgc atctgtacag 60 
gctgcagaaa tatataataa agacggtaat aaactggatg tctatggcaa agttaaagcc 12 0 
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atgcattata tgagtgataa cgccagtaaa gatggcgacc agagttatat ccgttttggt 18 0 

tttaaaggtg aaactcaaat taacgatcaa ctgactggtt atggtcggtg ggaagcggag 240 

tttgccggaa ataaagcgga gagtgatact gcacagcaaa aaacgcgtcfc cgcttttgcc 3 00 

gggttgaaat ataaagattt gggttctttc gattatggtc gtaacctggg cgcgttgtat 360 

5 gacgtggaag cctggaccga tatgttcccg gaatttggtg gcgattcctc ggcgcagacc 42 0 

gacaacttta tgaccaaacg cgccagcggt ctggcgacgt atcggaacac cgacttcttc 48 0 

ggcgttatcg atggcctgaa cttaaccctg caatatcaag ggaaaaacga aaaccgcgac 540 

gttaaaaagc aaaacggcga tggcttcggc acgtcattga catatgactt tggcggcagc 600 

gatttcgcca ttagtggggc ctataccaac tcagatcgca ccaacgagca gaacctgcaa 66 0 

10 agccgtggca caggcaagcg tgcagaagca tgggcaacag gtctgaaata cgatgccaat 720 

aatatttatc tggcaacttt ctattctgaa acacgcaaaa tgacgccaat aactggcggc 780 

tttgccaata agacacagaa ctttgaagcg gtcgctcaat accagtttga ctttggtctg 840 

cgtccatcgc tgggttatgt cttatcgaaa gggaaagata ttgagggcat cggtgatgaa 90 0 

gatctggtca attatatcga-cgtcggtgct acgtattatt tcaacaaaaa tatgtcagcg 960 

15 tttgttgatt ataaaatcaa ccaactggat agcgataata agctgaatat taataatgat 102 0 

gatactgtcg cggttgggat gacgtatcag ttttaa 1056 

<212> Type : DNA 
<211> Length : 1056 

SequenceName : SEQ ID 586 

20 SequenceDescription : 

Sequence 



<213> OrganismName : Shigella flexneri 2a str. 2457T 

25 <4 00> PreSequenceString : 

atgcgtaagc agtggctcgg gatctgcatc gcggcaggaa tgctcgcggc atgtacaagc 60 

gatgatggtc agcaacagac ggtaagtgta ccgcagcctg cggtatgtaa cggccccata 12 0 

gttgaaatta gcggggcgga cccgcgtttc gaaccactga acgcgacggc aaatcaggat 180 

taccagcgcg acggtaaaag ctacaaaatc gtgcaggatc cgtctcgatt tagccaggcg 240 

30 ggactggcgg caatctatga tgccgaacca ggcagtaacc tgacggcctc tggcgaagct 3 00 

ttcgatccga caaagctgac ggcggcccat ccaacgcttc cgatccccag ctacgccaga 360 

atcactaacc tggctaacgg gcgaatgatc gtggtgcgca ttaatgatcg cggcccttac 42 0 

ggcaacgacc gcgttatttc gctttctcgc gcggcagctg accgtcttaa cacgtcaaac ~ 48 0 

aacaccaaag tfccgtatcga tccgattatt gtcgcccagg atggttcgct ttctggtcct 540 

35 ggtatggctt gtaccacagt cgccaaacag acttacgccc tgcctgcacc tcccgattta 600 

agcggtggcg cgggaacaag ttcagtgtct ggcccgcagg gtgacattct tccggtcagt 660 

aattcgacgc taaaaagcga agatccgacc ggcgcgccgg taaccagtag cggtttcctc 72 0 

ggcgctccaa cgaccttagc gcctggtgta ctggaaggpa gcgaaccgac gcctgcgcca 780 

cagcccgttg ttacagcttc gtcgacaacg cctgcaacct cgcctgcaat ggtgacaccg 840 

40 caagccgcct cgcaaagcgc cagcggcaac tttatggtgc aagtcggggc cgtaagcgat 900 

caggctcgtg cgcaacagta ccaacagcaa ctgggacaga agttcggcgt ccccggtcgc 960 

gtaactcaaa atggcgcggt ctggcggatc cagcttggcc cattcgccag caaagccgaa 1020 

gccagtacct tgcagcaacg tttgcaaacc gaagcccaat tacagtcatt tattactacc 108 0 

gcgcagtag 1089 

45 <212> Type : DNA 

<211> Length : 1089 

SequenceName : SEQ ID 587 
SequenceDescription : 

50 Sequence 



<213> OrganismName : Shigella flexneri 2a str. 2457T 
<4 00> PreSequenceString : 

atgaagaaaa aaacgatata tcagtgcgtt attttgttct ttagccttct taacatccat 60 
55 gtcgggatgg ctgggcctga acaagtcagt atgcatattt atgggaatgt ggtcgatcag 120 
ggctgtgatg tcgccaccaa aagtgcatta caaaatattc atattggtga ttttaatatc 180 
agtgattttc aggccgcgaa taccgtaagc actgctgctg atttgaatat tgatatcacc 240 
ggttgtgccg ctggtattac tggcgcggac gtccttttta gcggcgaggc tgacaccctt 3 00 

gcgccgacac tgctcaaact aactgacaca ggcggaagcg gtggtatggc aacggggatt 3 60 

60 gccgtgcaaa ttcttgatgc gcaaagtcag caagaaatcc cgctcaatca ggtccagcct 420 
cttacgccct taaaagccgg ggataacaca ctcaaatatc aacttcgtta taagtccaca 480 
aaggcgggag caacgggcgg taatgcgacg gcggttctct attttgatct ggtttaccag 540 
tga 

<212> Type : DNA 
65 <211> Length : 543 

SequenceName : SEQ ID 588 
SequenceDescription : 



543 
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Sequence 

<213> OrganismName : Shigella flexneri 2a str. 2457T 
5 <400> PreSequenceString : 

atgaaaaaca aattgttatt tatgatgtta acaatactgg gtgcgcctgg gattgcagcc 60 
gcagcaggtt atgatttagc taattcagaa tataacttcg cggtaaatga attgagtaag 120 
tcttcattta atcaggcagc cataattggt caggctggga ctaataatag tgctcagtta 18 0 

cggcagggag gctcaaaact tttggcggtt gttgcgcaag aaggtagtag caatcgggca 240 
10 aagattgacc agacaggaga ttataacctt gcatatattg atcaggcggg cagtgccaac 3 00 

gatgccagta tttcgcaagg tgcttatggt aatactgcga tgattatcca gaaaggttct 360 
ggtaataaag caaatattac acagtatggt actcaaaaaa cggcagttgt agtgcagaga 42 0 

cagtcgcaaa tggcaattcg cgtgacacaa cgttaa 45 6 

<212> Type DNA 
15 <211> Length : 456 

SequenceName : SEQ ID 589 
SequenceDescription : 

Sequence 

20 

<213> OrganismName : Shigella flexneri 2a str. 2457T 
<4 00> PreSequenceString : 

gtgatgaaat ttaaaaaatg tcttctgcct gtggcaatgt tagcgtcatt cactctggca 60 
ggatgccagt caaatgctga cgatcatgcc gccgatgttt atcaaaccga tcaactgaat 120 

25 accaaacaag aaactaaaac cgttaatatt atttccattc ttcccgcaaa agttgccgta 180 
gacaactccc aaaataaacg gaacgcacaa gccttcggcg cgcttattgg cgcagtcgct 240 
ggcggtgtta tcggccacaa cgtcggttct ggcagcaatt ccggaacgac ggcaggggca 3 00 

gttggcggcg gagctgtagg cgcggcagcg ggttctatgg tgaatgataa aaccttagtg 3 60 

gaaggtgttt ctttaacata taaggaaggc accaaagtgt atacctccac ccaggagggt 42 0 

30 aaagagtgcc agtttacgac aggtttagcc gttgttatta ccacaacgta taacgaaacg 480 
cgtattcagc caaataccaa atgtcctgaa aagagctaa 519 
<212> Type : DNA 
<211> Length : 519 

SequenceName : SEQ ID 590 

35 SequenceDescription : 

Sequence 



<213> OrganismName : Shigella flexneri 2a str. 2457T 

40 <4 00> PreSequenceString : 

atgcaaacga aaaaaaatga aatttgggtg ggtatctttt tattagcagc actgctggcg 60 
gcgctgtttg tttgcctgaa ggcggcgaac gtgacgtcca tacgtactga atcgacctac 120 
acgctttatg cgacgttcga taacattggc ggcctgaaag cgcgctctcc ggtcagcatt 180 
ggtggtgttg ttgtgggccg ggtggcggat attacgctgg acccgaaaac ctatctgccg 240 

45 cgcgtaacgc tggaaattga acaacgttat aaccacattc ctgataccag ttcgctgagc 3 00 

attcgtactt ccggcctgct gggagaacaa tatctggcat taaacgtcgg ttttgaagac 360 
ccggaactgg ggactgctat cctgaaggat ggcgatacaa ttcaggacac caagtctgcg 42 0 

atggtgctgg aagatctcat tggtcagttc ctttacggta gtaaaggcga tgacaataag 480 
aatagtggcg atgcgccagc tgctgcgcca ggtaataatg aaaccactga acctgtgggt 540 

50 acaacgaaat aa 552 
<212> Type : DNA 
<211> Length : 552 

SequenceName : SEQ ID 591 
SequenceDescription : 

55 

Sequence 



<213> OrganismName : Shigella flexneri 2a str. 2457T 
<4 00> PreSequenceString : 

60 atggcaccgt tagccttttc tgcacaatca ttggctgaat cattaacggt ggaacaacgc 60 

cttgagttat tagaaaaagc gttaagagaa acgcaaagcg aactcaaaaa gtataaagat 120 

gaagagaaga aaaaatatac gccagcgacg gtgaatcgta gcgtaagtac gaatgatcaa 180 

gggtatgccg ccaatccgtt cccgaccagt agtgccgcaa aacctgatgc tgtactggtc 24 0 

aaaaatgaag agaaaaatgc cagtgagaca ggctcgattt attcttccat gactctgaaa 3 00 

65 gatttcagta agtttgtgaa agatgaaatt ggctttagtt acaacggcta ttaccgttct 3 60 

ggttggggga ccgcctctca tggttcacct aaatcatggg cgattggttc tctgggccgc 420 

tttggtaacg aatactccgg ctggtttgat ttgcagttaa aacaacgtgt ctacaacgaa 48 0 
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aacggcaaac gggttgatgc cgttgtgatg atagatggta acgttggtca gcagtactct 540 

accggctggt ttggcgataa tgccggtggc gagaacttta tgcagttctc cgatatgtac 600 

gttaccacca aaggtttcct gccctttgcg ccagaggctg atttctgggt gggtaaacac 660 

ggtgcgccga aaattgaaat ccagatgctg gactggaaaa cgcagcgtac tgatgctgca 72 0 

5 gcgggtgtag gtctggaaaa ctggaaagtc ggtccgggta aaattgatat cgcgctggtt 78 0 

cgcgaagatg tcgatgatta cgatcgcagc ctgcaaaaca aacagcagat taatacccat 840 

actattgatt tacgctataa agatatcccg ttatgggata aagcgacctt aatggtgagt 90 0 

ggtcgttatg tcacggcaaa cgaaagcgca tcggaaaaag ataatcagga taataacggg 960 

tattatgact ggaaagatac ctggatgttc ggcacatctt taacgcagaa atttgataaa 1020 

10 ggtggcttca acgaattctc cttcctggtt gcgaataact ctatcgccag gaactttggc 1080 

cgttatgctg gcgcaagtcc atttaccacc tttaatggtc gttattatgg tgatcacacc 1140 

ggcggaacag cagttcgtct gacctcgcag ggcgaagcct atatcggcga tcattttatt 12 00 

gtggctaacg cgattgttta ctccttcggt aacaatattt atagctacga aacaggcgca 12 60 

cactctgatt ; tcgaatctat tcgtgcggtt gttcgcccgg cctatatttg ggaccaatat 13 20 

15 aaccagacag gtgttgaact gggctatttc acccagcaaa acaaagatgc gaatagtaat 13 80 

aaatttaatg agtctggtta taaaaccacg ctcttccata cctttaaagt caataccagt 1440 

atgttgacct cgcgtctgga aattcgtttc tacgccacgt atatcaaagc cctggaaaac 1500 

gaactggatg gcttcacctt cgaagacaat aaagacgccc agtttgctgt cggtgcccag 1560 

gctgaaatct ggtggtaa 1578 

20 <212> Type : DMA 

<211> Length : 1578 

Sequence Name : SEQ ID 592 
SequenceDe script ion : 

25 Sequence 



<213> Organi smtfame : Streptococcus mutans UA159 
<400> PreSequenceString : 

atgaaaaaaa gaattttatc agcagtttta gtgagtggtg taactcttag ttctgcgaca 60 

30 acattatcag ctgtaaaagc tgatgacttt gatgcgcaga ttgcgtcaca agattctaaa 12 0 

atcaacaact tgaccgcaca acagcaagca gcacaagcac aagttaatac gattcaagga 180 

caagtaagtg ctttacagac acaacaagct gaattacaag ctgaaaatca aagacttgaa 240 

gctcagtctg ctactttggg tcaacaaatt caaacacttt caagcaaaat tgttgcacgt 3 00 

aatgaatctt tgaagcaaca agctcgtagt gctcaaaaaa gtaacgcagc taccagctat 3 60 

35 attaatgcta tcattaattc aaaatcagtt tctgatgcta ttaatcgtgt ttcggctatt 420 

cgtgaagttg tatctgctaa tgaaaaaatg cttcaacaac aagagcaaga taaagcagct 480 

gttgagcaaa agcaacaaga aaatcaagca gcaattaata ctgttgcagc taatcaggag 54 0 

acaattgctc aaaatacaaa tgctttaaat acacagcaag ctcaattaga agcagcacaa 600 

ctaaacttgc aagctgaatt gactactgca caagatcaaa aagctacttt agttgctcaa 660 

40 aaagcggcag cagaggaagc tgcacgccaa gcagcagcag cacaagcggc agcagaagct 72 0 

aaggccgcag cagaagcgaa agctttacaa gaacaagcag cgcaagcaca agcagcagca 780 

aataataata ctcaagctac agatgtttct gaccaacaag cagcggcagc tgataacact 840 

caagcagcac aaacaggtga ttcaactgag cagtcagcag cacaagcag't aaataattct 900 

gatcaagaaa gtactacagc aacagaagca caaccatcag cttctagtgc ttcgacagct 960 

45 gctgtagcag ctaatacttc ttctgctaat acatatccag cagggcaatg tacttggggt 1020 

gttaaatcat tagctccttg ggtaggaaac tactggggta atggtggaca atgggcagca 1080 

agtgcagcag cggcaggata tagagttggt tctacacctt cagctggagc tgtagctgta 1140 

tggaatgatg gcggttatgg acacgttgct tatgttacag gtgttcaagg tggccaaatt 12 00 

caagttcaag aagctaacta tgcaggtaac caatctattg gtaactaccg tggttggttt 1260 

50 aatccaggta gtgtaagcta tatctatcca aactaa 1296 
<212> Type : DNA 
<211> Length : 1296 

SequenceNatne : SEQ ID 593 
SequenceDescription : 

55 

Sequence 



<213> OrganismName : Streptococcus mutans UA159 
<40 0> PreSequenceString : 

60 atgaaagtca aaaaaactta cggttttcgt aaaagtaaaa ttagtaaaac actgtgtggt 60 

gctgttctag gaacagtagc agcagtctct gtagcaggac aaaaggtttt tgccgatgaa 120 

acgaccacta ctagtgatgt agatactaaa gtagttggaa cacaaactgg aaatccagcg 180 

accaatttgc cagaggctca agggagtgcg agtaaggaag ctgaacaaag tcaaaaccaa 240 

gctggagaga caaatggttc aataccagtt gaagtaccta aaactgatct tgatcaagca 3 00 

65 gcaaaagatg ctaagtctgc tggtgtcaat gttgtccaag atgccgatgt taataaagga 360 

actgttaaaa cagctgaaga agcagtccaa aaagaaactg aaattaaaga agattacaca 420 

aaacaagctg aggatattaa gaagacaaca gatcaatata aatcggatgt agctgctcat 480 



WO 2005/076010 



251/341 



PCT/IN2005/000037 



gaggcagaag ttgctaaaat caaagctaaa aatcaggcaa ctaaagaaca gtatgaaaaa 540 

gatatggcag ctcataaagc cgaggttgaa cgcattaatg ctgcaaatgc tgccagtaaa 60 0 

acagcttatg aagctaaatt ggctcaatat caagcagatt tagcagccgt tcaaaaaacc 660 

aatgctgcca atcaagcagc ctatcaaaaa gcccttgctg cttatcaggc tgaactgaaa 720 

5 cgtgttcagg aagctaatgc agccgccaaa gccgcttatg atactgctgt agcagcaaat 780 

aatgctaaaa atacagaaat tgccgctgcc aatgaagaaa ttagaaaacg caatgcaacg 840 

gccaaagctg aatatgagac taagttagct caatatcaag ctgaactaaa gcgtgttcag 9 00 

gaagctaatg ccgcaaacga agcagactat caagctaaat tgaccgccta tcaaacagag 960 

cttgctcgcg ttcaaaaagc caatgcggat gctaaagcgg cctatgaagc agctgtagca 1020 

10 gcaaataatg ccaaaaatgc ggcactcaca gctgaaaata ctgcaattaa gcaacgcaat 10 80 

gagaatgcta aggcgactta tgaagctgca ctcaagcaat atgaggccga tttggcagcg 1140 

gtgaaaaaag ctaatgccgc aaacgaagca gactatcaag ctaaattgac cgcctatcaa 12 00 

acagagctcg ctcgcgttca aaaagccaat gcggatgcta aagcggccta tgaagcagct 12 60 

gtagcagcaa ataatgccgc aaatgcagcg ctcacagctg aaaatactgc aattaaaaag 1330 

15 cgcaatgcgg atgctaaagc tgattacgaa gcaaaacttg ctaagtatca agcagatctt 13 8 0 

gccaaatatc aaaaagattt agcagactat ccagttaagt taaaggcata cgaagatgaa 1440 

caagcttcta ttaaagctgc actggcagag cttgaaaaac ataaaaatga agacggaaac 1500 

ttaacagaac catctgctca aaatttggtc tatgatcttg agccaaatgc gaacttatct 1560 

ttgacaacag atgggaagtt ccttaaggct tctgctgtgg atgatgcttt tagcaaaagc 1620 

20 acttcaaaag caaaatatga ccaaaaaatt cttcaattag atgatctaga tatcactaac 1680 

ttagaacaat ctaatgatgt tgcttcttct atggagcttt atgggaattt tggtgataaa 1740 

gctggctggt caacgacagt aagcaataac tcacaggtta aatggggatc ggtactttta 180 0 

gagcgcggtc aaagcgcaac agctacatac actaacctgc agaattctta ttacaatggt 1860 

aaaaagattt ctaaaattgt ctacaagtat acagtggacc ctaagtccaa gtttcaaggt 1920 

25 caaaaggttt ggttaggtat ttttaccgat ccaactttag gtgtttttgc ttccgcttat 1980 

acaggtcaag ttgaaaaaaa cacttctatt tttattaaaa atgaattcac tttctatgac 2 040 

gaagatggaa aaccaattaa ttttgataat gcccttctat cagtagcttc tcttaaccgt 2100 

gaaaataatt ctattgagat ggccaaagat tatacgggta aatttgtcaa aatctctgga 2160 

tcatctatcg gtgaaaagaa tggcatgatt tatgctacag atactctcaa ctttaggcag 222 0 

30 ggtcaaggtg gtgctcgttg gaccatgtat accagagcta gcgaaccggg atctggctgg 22 8 0 

gatagttcag atgcgcctaa ctcttggtat ggtgctggtg ctatccgcat gtctggtcct 2340 

aataacagtg tgactttggg tgctatctca tcaacacttg, ttgtgcctgc tgatcctaca 2400 

atggcaattg aaactggcaa aaaaccaaat atttggtatt ctttaaatgg taaaatccgt 2460 

gcggttaatg ttcctaaagt taccaaggaa aaacccacac cgccggttaa accaacagct 2520 

35 ccaactaaac caacttatga aacagaaaag ccattaaaac cggcaccagt agctccaaat 2580 

tatgaaaagg agccaacacc gccgacaagg acaccggatc aagcagagcc aaacaaaccc 2640 

acaccgccga cctatgaaac agaaaagccg ttggagccag cacctgttga gccaagctat 27 00 

gaagcagagc caacaccgcc gacaaggaca ccggatcagg cagagccaaa taaacccaca 2760 

ccgccgacct atgaaacaga aaagccgttg gagccagcac ctgttgagcc aagctatgaa . 2 82 0 

40 gcagagccaa cgccaccgac accaacacca gatcaaccag aaccaaacaa acctgttgag 28 80 

ccaacttatg aggttattcc aacaccgccg actgatcctg tttatcaaga tcttccaaca 2940 

cctccatctg taccaactgt tcatttccat tactttaaac tagctgttca gccgcaggtt 3 0 00 

aacaaagaaa ttagaaacaa taacgatatt aatattgaca gaactttggt ggctaaacaa 3 060 

tctgttgtta agttccagct gaagacagca gatctccctg ctggacgtga tgaaacaact 3120 

45 tcctttgtct tggtagatcc cctgccatct ggttatcaat ttaatcctga agctacaaaa 3180 

gctgcaagcc ctggctttga tgtcacttat gataatgcaa ctaatacagt caccttcaag 324 0 

gcaactgcag caactttggc tacgtttaat gctgatttga ctaagtcagt ggcaacgatt 33 00 

tatccaacag tggtcggaca agttcttaat gatggcgcaa cttataagaa taatttcacg 3360 

ctcacagtca atgatgctta tggcattaaa tccaatgttg ttcgggtgac aactcctggt 342 0 

50 aaaccaaatg atccagataa tccaaataat aattatatta aaccaactaa ggttaataaa 3480 

aacgaaaatg gcgttgttat tgatggtaaa acagttcttg ccggttcaac gaattattat 3 540 

gagctaactt gggatttgga tcaatataaa aacgaccgct cttcagcaga taccattcaa 3 600 

aaaggatttt actatgtaga tgattatcca gaagaagcgc ttgaattgcg tcaggattta 3 660 

gtgaagatta cagatgctaa tggtaatgaa gttactggtg ttagtgtgga taattatact 3 720 

55 aatcttgaag cagcccctca agaaattaga gatgttcttt ctaaggcagg aattagacct 3780 

aaaggtgctt tccaaatttt ccgtgccgat aatccaagag aattttatga tacttatgtc 3 840 

aaaactggaa ttgatttgaa gattgtatca ccaatggttg ttaaaaaaca aatgggacaa 3900 

acaggcggca gttatgaaaa tcaagcttac caaattgact ttggtaatgg ttatgcatca 3 960 

aatatcatta tcaataatgt tcctaagatt aaccctaaga aagatgtgac cttaacactt 402 0 

60 gatccggctg atacaaataa tgttgatggt cagactattc cacttaatac agtctttaat 4080 

taccgtttga ttggtggcat tatccctgca gatcactcag aagaactctt tgaatacaat 4140 

ttttatgatg attatgatca aacaggagat cactatactg gtcagtataa agtttttgct 4200 

aaggttgata tcacttttaa agacggttct attatcaagt caggtgctga gttaactcag 426 0 

tatacgacag cggaagttga taccgctaaa ggtgctatca caattaagtt caaggaagcc 4320 

65 tttctgcgtt ctgtttcaat tgattcagcc ttccaagctg aaagttatat ccaaatgaaa 4380 

cgtattgcgg ttggtacttt tgaaaatact tatattaata ctgtcaatgg ggtaacttac 4440 

agttcaaata cagtgaaaac aactactcct gaggatccta cagaccctac tgatccgcaa 450 0 
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gatccatcat caccgcggac ttcaactgta attaactaca aacctcaatc aactgcttat 4560 
caaccaagct ctgttcaaga aacattacca aatacgggag taacaaacaa tgcttatatg 4620 
cctttacttg gtattattgg cttagttact agttttagtt tgcttggctt aaaggctaag 468 0 
aaagattga 4689 
5 <212> Type : DNA 

<211> Length : 4689 

SequenceName ; SEQ ID 594 
SequenceDescription : 

10 Sequence 



<213> OrganismName : Streptococcus mutans UA159 
<4 00> PreSequenceString : 

atgttaacgg aattaaaagc ggttttaaaa aagcctatgc tttggattac gafcggtagga , , 60 

15 gtagcccttg tccctgccct atataatatt atttttttga gttctatgtg ggatccttat 120 

ggcaaagtat ctgatttacc tgttgcagtt gttaataaag ataaaacggc aacttatgaa 180 

ggaaagaaga tgactatcgg taaagatatg actgataata tggtccgtaa taaaagtttg 24 0 

gactatcatt ttgttgatag cgaaaaagct caaaagggac ttgaaaaagg tgattactat 3 00 

atgatcatta ctttaccaga agatctttct caaaatgcgg ctagtgtttt aacagatgaa 3 60 

20 cctaaaaagc taacgattcc ttaccaaacg tctaaaggac atagttttgt tgcctccaaa 42 0 

atgagtgaaa ctgctgctaa gactttaaaa gagtctgtgt cgaaaaacat tacaagttct 480 

tacaccaaat cactttttaa gaatatgtca accctaaaaa caggtcttgg cagtgcggct 540 

aatgcaagtc aaaaaatagc gactggttca aaacagttag caaatggcag tcaagtgatg 60 0 

actgataatc tgaatttact ttcaaattca agtcaatcat ttgctcaagg gactaatacc 660 

25 ttatattcgg gcttaacagc ttatacggga ggtgtcggtc agctttctgc aggtttaaat 720 

aatttaaaca atggtttgac agcttataca aatggagttg gtcagttagc aaatggcagc 780 

agccaactga gcaatcaatc tcagaagctt ctaggaggtg ttgcgcaatt agcgaatggt 840 

tctgcttcta ttcaacaatt ggttaatgct agcagtcagt tgaatcaagg acttattaag 90 0 

ctgtcaacag caacaggtct ttctgaagaa caagttcaac agtttagctc attgattaat 960 

30 cagttgggaa ctttgaatca aagtattcaa aattacagtg ataatgggac agcaacgact 1020 

gcaaatagtc ctgatcttag tacgtattta tctgccatta caacagcagc tcaagcaatt 1080 

gttaattcag gaaatacgtc tcagcaaaca acaactaatc agtccaatgc attggctgca 1140 

gtccaagcta caggtgctta tcaaagatta tccgctgagg atcaatcaga gatcgctgca 12 00 

gctttggcca acactggtag ttcaacaaca actacaggtg ctgatgcaaa cgcagtgtca 1260 

35 caagcccaag ctatcctaaa caatgtccaa agtattcaaa gtgccttatc tactttacag 1320 

acaacaactg ctaatacacc aacaagccca tcagccagct tgactcaaat taaaaataca 13 8 0 

gctaattctg tattacctag tgcggcaact tccttgacaa ccttgtcaag tggtttgaca 144 0 

caagcaaaaa cagctcttga ttcgcaagta gtccctgtca gcacagccct tgctaatggt 15 00 

acggctcaat taggctctac tttttcaaca ggcgccaatt ccttgatgac aggagtaggc * 1560 

40 caatacacta atgcagttga tattcttaat gcaggagcta atactttggc tgctaaaaat 162 0 

aaccagttaa cagatggtac aagtcaattg gtaaatggtg ccaatcaatt aaatagcaac 1680 

tctggacaat taacgaaagg gactgcacag ttagcaaatg gtgctaatca aatagagaca 1740 

ggagctggca aattggctgc aggtggagaa agtttaacgg cgggcttgac aactttgtca 18 00 

agtggtagtg gtgaactgtc gaaagcattg tcaactgcta aaaataaatt atcattagta 1860 

45 gcagttgaca atgataatgc taagactttg tcaagtcctg tcactatcaa gcatacagat 192 0 

aaagataatg tcaaaacaaa tggtgttgga atggctcctt atatgatgtc agcagcttta 198 0 

atggtaatgg caatttcaac caatactatc tttagagtag cactttctgg taagcaagct 2040 

aaaaccctac gagaatggat agatcaaaag ttagcagtca atggcttgat tgctgttact 210 0 

ggagctatta ttctctattt tggtgttcat attattggtt tatcagctaa ttttgaacta 2160 

50 aaaactttag gattaattat tcttaccagt atcactttta tggtcttagt gacaactttg 222 0 

gtaacttggc atgataaatt tggctctttt gctgccttaa ttttactttt actacaatta 2280 

ggttccagtg caggaaccta tcccttagcc gttacagata agtttttcca agttgtcaat 2340 

ccttatttac caatgagtta ttcggtttct ggtttacgag aaaccatttc tatggctggt 240 0 

acaattggta atcaactact agcgctgagt ttatttttcc ttacttttgc tgctttagga 2460 

55 ttgttaatcg ctcgaaggag gattaggtct gtcaaagtag cctaa 2505 
<2X2> Type : DNA 
<211> Length : 2505 

SequenceName : SEQ ID 595 
SequenceDescription : 

60 

Sequence 



<213> OrganismName : Streptococcus mutans UA159 
<400> PreSequenceString : 
65 atggtatccc aaaaaaataa atctaaaaag ggtcaatcta aaacgtttac cttaatttca 60 
aatagaatta atctcctatt ttttttgatt gtcgctttgt ttactgtttt gcttttgagg 12 0 

ttagctcaga tgcagcttta tgatgcaaag ttttacaaat ctaaattgac agagtcaaca 180 
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acatatacta taaaaacctc cagccctcgc 
ttagttgaaa acgaggttaa agaggttgtt 
aaggatatta aagcaaatgc aaaaaagtta 
gtaaccaaac gtcagaaaaa agattactat 
5 gttaaaaaat taccaaataa taaaaaatat 
aagatctatg caaatgctgt taaagcagtt 
gagaaaaaaa teat teat at tttcagccag 
tcgttaacaa caggagattt aacagctgag 
gatttaaagg gaatttctgt taagactgac 

10 acttctatta teggtaaagt ttccagtcaa 
aactatgtta aaaaaggtta ttctctgaat 
caatatgaaa aegatttgea aggtagtcgt 
ggtaaaatta tcagtgataa gaccactgcc 
acccttgatc ttgaatttca aaaaggtgtt 

15 gaattagcat ctggaaatac caagtattct 
aacacaggtg cagttctttc tatggctggt 
tcttctaatg ctcttggagc ggttactgaa 
gcgactttga cagctggttg ggaaaacggt 
cagcctattc aatttgcggg ttcaagccct 

20 cctcttacag caagtcaatc cttagagtat 
ttaaaattaa tgggacaaga ttatcatagt 
gaagctatgg agaaafctaag agctacttat 
attgaccttc egggagagtc aaagggctat 
ttaacagaat catttgggca gtttgataac 

25 gcagctgtcg caaatggtgg taaaegtatt 
aataataaaa caggcggttt aggaaatctt 
aatgtttcta tetctagega tgacatgggg 
aacggtggta gttatgeaac agggaagact 
gcaaaaacag ggacagcega agettaegtg 

30 tccaatttaa acgttgttgc ttatgeacca 
gtcttgccac atgagacaga ccttcatgga 
atcaatcttt atcaaaagat gtatccaatg 
<212> Type : DMA 
<211> Length : 2079 

35 SequenceName : SEQ ID 596 

SequenceDe script ion : 



ggacaaattt 
gectttacaa 
gcagatatgg 
ctggcagatc 
gataactttg 
ccaaacagtg 
atgaatgcaa 
caaattgeag 
tgggaacgta 
aaaaccggtc 
gaccgcgttg 
actgttcaag 
aaagggacta 
-gaacaaattc 
gaaggcgttt 
ttggaacacg 
gtctttactc 
gtcttatcag 
attaactctt 
tcttctaata 
ggtatgacct 
gctcaatatg 
acgccagaac 
tataeggcta 
gctccccatt 
gttcaatcta 
attatcaagg 
ettgeaaaag 
acaggagatg 
agcagcaatc 
accactagtc 
aatcagtga 



atgacgctaa aggggtggct 
gaagcaatac catgactgcc 
tgactttaac tgaatctaag 
caaaaaatta tcaaaaaatc 



gaaataacct 
ctattgatta 
cctctgtttt 
tattggctac 
aaacagataa 
tgcctgctga 
gaacatctta 
caatcaaggt 
aaggaaaaaa 
ttaaccaata 
atgctgttgt 
accttaaaac 
cgggttccgt 
gcaatcaagt 
ggtttaccaa 
cttatatggt 
tatcaacaga 
gtttgggtgt 
attatgatcc 
tgcagctggc 
tggttgaagg 
ttgacaccaa 
aaggtttcta 
gggcaagtgt 
atggtaaatc 
etcaaattge 
atgetattae 



aaeggaateg 
ttctgaagat 
taatacagct 
aagcaaatca 
aaattccatt 
agaagecaat 
tcttgaaaag 
taacaaagaa 
tctgaagttg 
ttttaattct 
tcttaacccg 
gggcgaagta 
tgttaaagga 
actcaatgac 
tggatcaact 
tcaattggct 
tggctataag 
ctcaacagga 
ttctaatgtt 
acaatatget 
tatctatgat 
ggttttaaat 
taatgttgtt 
tcccatttcg 
tgtttataca 
tgtagctgtt 
gagagatatc 



240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2079 



Sequence 

40 <213> OrganismName : Streptococcus mutans UA159 
<400> Pre Sequenc eSt ring : 

atgacagttc taaaatatgg actaggtatt etcttaageg ctattatttt 
attggaggtc ttctgtttac ctattatgtc agcagtactc ctaaactatc 
cttaaagcta ctaattctag tttggtttat gatagcaata ataatctgat 

45 ggtgctgaaa agegegaaag tatttcttca gacagtattc caatgaagtt 
gttacctcta ttgaagatca ccgtttcttt aaacatcgtg gtgtcgacat 
attggtgcag cttggagtaa tttacttcat aaatcaactc aagggggatc 
cagcagctta tcaagctggc ctatttctct actaaagagt ctgatcagac 
aaagctcaag aagtttggct gtctctacaa atggagaaaa aatacacgaa 

50 ctaacttttt atgtcaataa ggtttacatg ggtaatggga attaeggaat 
gcaaagtctt attatggcaa ggatcttaaa gacttatcaa ttgcccagct 
gcaggtattc cgcaagcacc gacacaatat gatccttacg ctcagccaaa 
ageagaegta ataccgtttt gtcacagatg tataaacata aaaaaattac 
tatgatgetg cagtagcaac accaatttct gatggcctgc aagaactgaa 

55 agttatccaa aatatatgga taattatctg aaacaggtta tttcagaggt 
actggtcaag atatcttttc agcaggcatg aaggtttata caaatgttaa 
cagcaatatc tctggaacat ttataataca gatgaatata ttgettatec 
ttccaagttg cttctactgt tatggatgtt actaatggta aagttattgc 
ggacgccatc aagataccaa tgtttctttt ggtaccaatc aggctgtctt 

60 gactggggat caaccatgaa acctatttca geatatggee ctgctcttga 
tttacgacaa ctgeacagat gctaaatgac teggtctatt attatccagg 
caagtctatg actgggatca tegttataat. ggttggatga ctatccaaac 
caatctegta atgtccctgc tgtcagagct attgatgecg ctggattaga 
ggtttcttaa geggtctegg tattgattat ectgagatge gttattcaaa 

65 agtaatacaa gtagttcaga acaaaagtat ggtgccagca gtgaaaaaat 
tatgetgett tttctaatgg tggaacttat tatgaaccac aatacgtcaa 
tttaaggatg gaacatcaga gacctatgat gctaaaggca ategtgegat 



agecattata 60 

agaagctaaa 120 

tgctgattta 180 

agtaaatgee 240 

ttategtatt 300 

cactcttgat 360 

ettaaaaegt 420 

agaagaga 1 1 480 

gcgcactgct 540 

agcgacactc 600 

ggcagctaca 660 

aaaacgagaa 72 0 

acgctcctct 780 

gaaaaaacgt 84 0 

tgecgatgea 90 0 

tgatgataat 960 

acagcttggc 1020 

aactgatcgt 108 0 

aagegaaget 1140 

tacaacaaca 120 0 

ggctatccaa 1260 

tactgecaaa 132 0 

cgccatttca 1380 

ggccgccgct 1440 

taaaatagaa 1500 

gaaagaaacg 1560 
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acagcctaca tgatgacaga tatgttaaaa acagtattaa catatggtac tggtactgag 1620 

gctgctattc ctggtcttta tcaagcaggt .aaaacaggaa catccaacta tgatgacaat 168 0 

gaattggtag agatgtctga aaaacttggt atfcaatcctt atggacttgg tactattgct 1740 

ccagatgaaa actttgttgg ttatacacct cagtattcaa tggctgtttg gacaggatat 1800 

5 aaaaatcgct taatgcctgt ttacggagac agtatgaaaa ttgctgcgca agtctatcgt 1860 

actatgatgg cttatctttc tagctcaggt aattctgatt ggaccatgcc tgacggtctc 192 0 

tatcgcagcg gtggttatct ttacctaaat ggttcaagtg ggtcaaatag taggtatggt 1980 

gcagctcctg caacttcatc gtcatcttct tcatcatctt cttctgattc aaacaataac 2040' 

gatcaaaata ataatcaaac tacagaagcg tctagtgact catcttcatc aagttctgat 2100 

10 gctacgacat cttctaatcc ataa 2124 



<212> Type : DNA 
<211> Length : 2124 

SequenceName : SEQ ID 597 

<• * o SequenceDe script ion : 

15 

Sequence 



<213> OrganismName : Streptococcus mutans UA159 
<400> PreSequenceString : 

20 atgaaatcga aaactgctaa aattactttg ctaagcagcc ttgctttggc ggcttttgga 60 

gcaacgaatg tttttgcaga tgaagcatca actcaattaa attctgatac tgttgcagca 120 

cctactgctg atacacaagc atcagaaccg gctgcaacag aaaaagaaca gtctcctgtt 180 

gtagctgttg tcgaaagtca cacacaagga aatacaacaa cgacaacatc tcaagttact 240 

tctaaagaat tggaagatgc taaggctaat gctaatcagg aaggtttaga agtcactgaa 3 00 

25 actgaagctc aaaaacagcc ttcggtagaa gctgcagatg cagataacaa agcacaggca 3 60 

caaacaatta atacagcggt agctgattat caaaaggcaa aagctgaatt tcctcaaaaa 42 0 

caagaacaat ataataaaga ttttgaaaag tatcagtctg atgtcaagga gtatgaagct 480 

caaaaggcag cttacgagca atataaaaaa gaagttgcac aaggtttggc atctgggcgt 540 

gttgaaaaag cccaaggact tgtgtttatt aatgaacctg aggcaaaact ttctattgag 600 

30 ggtgttaatc agtacctaac aaaagaagca cgtcaaaaac atgcaactga agatattctt 660 

cagcaatata atactgataa ttatacagct tctgatttta cccaagcaaa tccatatgat 720 

ccaaaagaag atacttggtt caaaatgaaa gtgggagatc agatttcagt tacctacgat 78 0 

aatatcgtta attcaaaata taatgataaa aagattagta aggtaaagat taattatact 840 

ctcaatagtt caacgaataa tgaaggcagt gcactggtca atttgttcca tgatccaact 900 

35 aagacaattt tcattggtgc acagacatct aatgctggca gaaatgataa aatcagtgtg 960 

acgatgcaaa ttatttttta tgatgaaaat ggcaatgaaa tcgatttaag cggcaataat 1020 

gccattatga gtctctcatc gttaaaccat tggacgacta agtatggcga tcatgtggaa 108 0 

aaagtaaacc ttggggataa tgaattcgtt aaaataccgg gctcatctgt tgacttacat 114 0 

ggcaatgaaa tctattcggc taaggacaac caatataaag ctaatggtgc aacctttaat 12 00 

40 ggtgatggag cagatggctg ggatgctgtc aatgctgatg gaacgccacg tgctgcgacg 1260 

gcttattatg gtgcaggtgc tatgacttac aagggagaac ccttcacctt tactgttggt 1320 

ggtaatgatc aaaacttacc aacaaccatt tggtttgcga ctaattcagc tgtagctgtg 13 80 

cctaaagatc cgggagctaa accaacaccg ccagaaaaac cagagttgaa aaaacctact 144 0 

gtgacttggc ataaaaatct tgttgttgaa actaaaactg aggaagttcc tccagtgaca 1500 

45 ccaccaacaa ctcctgatga accaacgcca gaaaagccaa aaacaccaga ggatcctcaa 1560 

tcacctgtcg tagctaagtc agtaagcttt agaacggcaa gaaaaggaga aatgcgtgtt 1620 

agagagcgtg attatcaacc gactcttcca catgctgggg ctgctaaaca aaatggttta 1680 

gctactcttg gtgctatttc aactgcattt gctgcggcta ctttgattgc agctagaaaa 174 0 

aaagaaaact ag 17 52 

50 <212> Type : DNA 

<211> Length : 1752 

SequenceName : SEQ ID 598 
SequenceDe script ion : 



55 Sequence 



<213> OrganismName : Streptococcus mutans TJA159 
<4 00> PreSequenceString : 

atggaacaga agatttttag caaacgaaaa agtaagattg ctgggctttg tggagctatt 60 

60 ttaacgacta cagttgttgc ccttgcgtca ggtactgtaa tcgaggctga tgagacaata 120 

gaacagcctg tcgcagctga gactgtctcg caagctgatg gggacaatcc cgaacaaaca 180 

acaagcgttc aacaagaaac tgctcctcaa caaacgaaaa cttctcaaag ctcagacgca 240 

accgtagata gtgaagagtc agcaacttcc ccatctgatg aacagaccgt aagtcaaaat 3 00 

gattcaaact catcatctca aattgatcaa acgatagctg atacgaatcg ctctgactct 3 60 

65 gatcatattt caaaaacatc agccgctaca actgaagatc aagaagagaa agttaattct 42 0 

gcaaaagcac aaactgctgc cgcaaccaac aatcaagaca ctcgttatag tgcgaaagat 480 

gcttatggca attccaattt taacaagaca ttaactgaat ttggaaaaaa tgctaatgtt 540 
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gctgatgtaa cctataatgg cgtgagggat gaatatattg tagttaacga tcctagtgct 60 0 

ccttacgttc ctaatgcaaa cgaaattgca aaatacttaa aggaatattt aacagaactc 660 

cgcaacatca ataatattgc tattcctgtg ccttctgttg atcaggttat gcaaaaatac 72 0 

gcacaagatc gggctaacga agaagccaat gaaaaaaacg gcttggatca tgatactaat 78 0 

5 ttacctatcc ctaataattt aacttgggtt gccgaagatg gacatttgga tatggatagc 84 0 

agcattcaat ccaaaagtca agaaggctat acacttgctt ctgataaagc aaccgcctac 90 0 

tatctagcgc ttaactggtt ttctgactat tttaatattt acgatgaccc caacgatggc 960 

ctcaaatcgt ttggacacgc tgtcagtatt ttgtcagacg ggggaactgg aatgggctta 1020 

ggtcttgctt caggtcaaga taatgaaaag ggaatgtggt acgcacaatt ggaatttggt 10 80 

10 ggtaacgata acgaagataa taccaacgat ttttcatctt taaaaaacgg caagggagaa 1140 

tgggtattat attataaagg aagtcctgtt aagtttcttc ctaacactac cttttggtat 12 0 0 

gtaaaaaaag gcacttcccc tgatgcagct tctactcctc acaacagtga taaaccttca 12 60 

ttccagtcat ctaaagatct tgaccctaat ttcaaggccg ataatagatt ccaagaagga 1320 

aaggaagcct ctgttcatca^ggctattcct gcaacattta > aatctcatcg cgatgaagtt 1380-. 

15 ggtaataaag accaaaattc tctttctgct caactacctg atacaggagt tcaaaaaaat ' 1440 

aatcaattag ccttgatagc tttaggaaca ggcttgattt tactttccgg acttcttctt 15 00 

tcaaaaagaa aatccttaaa ataa 1524 
<212> Type : DNA 
<211> Length : 1524 

20 SequenceName : SEQ ID 599 

SequenceDe script ion : 



Sequence 



25 <213> Organi smName : Streptococcus mutans UA159 
<4 00> PreSequenceString : 

atgacatttg aaaagcaaaa acactttagt ttacgtaaac taaaatttgg tttagtttca 60 

gttgcgatca tagctttctt atttgctgta acaaagactg cagaagctga cgagacagtg 12 0 

ataactgaac aaaggcaaac aagtaagatt aacgctagtt ctcaaaaagt ggagaatcag 180 

30 acttcaaatc aggtggaagc aaaaacggat agtgcaaaca aggatcctca agaaaaaaca 240 

ggaagtgttg caactgatgc cccttcaatg aattcagcta ataatatgag tcagtctgac 3 00 

aaacaaaata ctgttaatga aatatcttca gatagtcagc aaacaaaaac agatgaacaa 3 60 

actgatttac cgcaaaacag ctttaaacaa caatctgctc atgttaaaat gactactgaa 420 

gcagagaaga ccccatcaca ttcgattaat acctttgtta atgatggtaa tggtaattgg 480 

35 tattaccttg gtgctgatgg tagaaatgtt acaggcagtc atacgattgg tggcaagact 540 

atgtattttg ctcaagatgg taagcaagtt aaaggtgctt ttgctcaaga ttcggatgga 600 

aataaacatt attatgatag agatagtggt gagatgtgga ccaatcgctt tgtcaatgat 6 60 

caaggcaatt ggtattatct taataatgat ggtgtccctg tcaccggtag tattactgtc 720 

aatggtcagt ctctgtattt taattcggat ggtagtcagg ttaaagggaa ttttgttgaa 78 0 

40 gaagatggat ctttgcgtta ttatgataaa aattctggag atttactgag aaagacaagc 840 

cgaaccatta atggtgttaa ctaccaattt gataatgatg gaaatgcaag ggcgattgac 90 0 

aaaattgagg ttgttaagac cagtcttgta gttgatagtt atgaatttgg tccttctgtt 960 

tcaaagatta ttcttgagtt caatcataag gtaactcctg ctgttgttca tgctggtgca 1020 

atggtaacaa ctgctggagt tcaaagaaaa attcttaatt cttatgtctc taatgcttca 1080 

45 ggacatgtgg tttactttga tagtagccat tatgtgacac ttgaattaga tattccttat 114 0 

gatccaaatg atagcagccg aaatgcatca ccttttattt ttgactcagc agcctttcgt 1200 

aataactggg tcaacagtta tactgtcaaa gtagataatt tgcaggtgca agcagatggt 12 60 

tctaatagca gtcaaattat cagttcagag caagatgcta tcaataatcg tttcttgcct 132 0 

acaacggatc gtttctcaga acgtggtagt tatggtaatt ttaattatgc cgcctatcaa 1380 

50 ccagaagcag ctattggcgg tgagaagaat ccattgattg tctggttgca tggtatagga 144 0 

gaagtaggca ctgatattaa tattccgctt ctagccagca atgtggctcg tttaacggaa 1500 

gatcctattc aaagccattt cacttctaca ggtagtggtg gtcaaaaggg agcctatgtg 1560 

ctagttcctc aaagttcaat tccttggtct caaaatcaaa cagctagctt aatggcgctc 162 0 

attaaagcct atgtagcaag ccatccagac atcgatagcc gacgcattta tttggcaggt 168 0 

55 gtttctaatg gtggcggtat gactctggat atgggagtcg cttatcctaa ctattttgca 1740 

gccttagttc ctattgctgc ttcctatagt aatcaattaa cagataatca gattaccgct 1800 

gctgctttga aagctctgaa aggtcaacca atgtggttga ttcatacacg aactgataaa 1860 

acaatatctg cagatagtag tgttctacca ttctataaag agttacttca agctggcgca 1920 

caaaataaat ggctttccta ttatgaaact aatgttggta aacatcactc tggagtcact 198 0 

60 tataacggtc actggtcttg gatttatttc ttgaatgatc aagtaactgg cactcaaaat 2 040 

actgataacg ccaagaattg gtctggactt tctggcatgg ttgcgaccaa tccaacctat 2100 

ggtggtgatg ctaaggctac tgtcaatggc agaacttata gtaatgtctt tgattggcta 2160 

aatggtcagc gaagaaggta a 2181 
<212> Type : DNA 

65 <211> Length : 2181 

SequenceName : SEQ ID 600 
SequenceDescription : 
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Sequence 

<213> Organ istnName : Streptococcus mutans UA159 
5 <40 0> PreSequenceString : 

atgaaaattt ttataaaaaa acaccaacaa agtattcttt actatagtct tagttttctg 60 

ctaccaagtt ttataatgtt tctcgttcta ttctccaaaa atatttattg ggggagtagc 120 

acaactattt tagctagtga tggttttcat caatatgtga tttttgatgc tctttttcgt 180 

aatattctcc atggaacgga tagtttgttt tactctttta aggctgggct tggttttaat 240 

10 atttttgctc tgacaagtta ttacttggga agttttttaa caccttttac ttactttttt 300 

aatgtaaaaa atatggcaga tgctttttat ctcttcactt taatcaaatt tggtctaata 3 60 

ggtttatctg ctttttacag tcttgggcaa atttatacta aaatctctaa atcactcgtt 420 

ttgatgctgt caacatctta tgctttaatg agctttacta gcagtcagtt agaattaaac 4 80 

aattggttag atgtttttat cctgctacca cttattatgc ttggtttaca gcgtttagta 540 

15 gaaaaaaggg ggatttttct ttattttcta actcttactt gtttatttat tcaaaattac 600 

tattttggtt tcatgacagc tattttctta actctttggt tttttacgca agtctcgtgg 660 

gatattagaa acagaatgaa acgattaagt gattttgtgc tcgtatcaat ctttgcaacg 720 

ctgacaagtg cttttatgct gcttccaaca tttcttgatt taaagagcca tggtgaagta 7 80 

ttaacagaac aaattagtct attttcatca gacatttggt atttcgattt ttttgctaaa 840 

20 agtcttcttg gtagttatga tacgacaaaa tatggctcta ttccaacgat ttatatcggt 900 

ttacttccct tgatttttgc cattactttt ttctttgtta aatctataaa atggcaagtt 960 

aaagtagctt attttctttt attggctatt attattgcaa gttttatctt tcaaccactt 1020 

gatttatttt ggcaaggaat gcattcacct aatatgtttt tgcatcgtta ttcttgggct 1080 

ttctccttag ttattgtcat aatggcagct gaaacgttaa ctcggataaa ggatataaaa 1140 

25 ttgaaaaatt tttatccagc ctttaccttc ttgggagtag gacttttagc aactttttta 12 00 

ttcaaggact attataatta tctgacacaa gttaatttta tattaacaac tatcttttta 1260 

gttagttatt ttattattct ttttactttt tttaatcaat tagtttctta taaagttatt 1320 

atttccttta cacttatctt tacaagtttt gaaatagctt taaatacttt ttatcaaatt 13 80 

gaaggtattc aaactgactg gaatttccct tcaagagagg tttatgaaga taatgtaaag 1440 

30 gaaattgaca actatgttaa gaaaactaaa aaagataact tagaattttt tcgaacagaa 1500 

aaacaaattc cccaaactta caatgatggt atgaaattta attataatag catttctcag 1560 

ttctcatctg tcaaaaataa cttatcagca caattattga attctctagg ctactattca 1620 

caaggaaatc attctaccat tagttatcct aataatacta ttttgatgga tagtcttttt 1680 

tcaattaaat acaatattaa taatcaaaat cctcataaat ttggattcca tttaaaacag 1740 

35 aaaaacaata agctgcaact ttacaaaaac ttctattctc ttcctttagc acttatgtca 18 00 

aatcatattt acaaagatgt caagtttgac tcttatcccc ttgataatca acaaaaattt 1860 

gttaatgaat tgacagatct aaatcttaca cttttcaaag aaatccctat tatttcaagt 1920 

gtcggaatgc aagttttaga taatcgtgtt actattaatg gttcaaaagg aaataaggca 1980 

caagtttact atactgtaaa gtgtcctgca aatagtcaac tttatatcag ccttcctaac 2 040 

40 ttgacagtta ataataaaga cgaaaatgtc tttataacaa ctaacaagca cacaagttct 2100 

tatatcatag acgaaagtta ttatcttttt aatttaggaa attataaaaa aactcaaaca 2160 

ttaatattta agcttagttt tccaaaaaat aaaacggtta gttatgattt accacatatt 2220 

tatgctctcg atttaactgc ctatcaaaaa agtataaagc aattaaaaag tcaaactgtt 2280 

aaaacaacaa ctaagaaaaa taaaattttt actacctatg ttgccaaaaa gagaacttcc 2340 

45 ttgatttaca ctttaccata tgataaaggt tggtttgcta aacaaaatgg aaaagcaatt 2400 

aaaatatcta aagcacaaaa tggactaatg aaaattgatg tttctaaagg tagtgggaag 2460 

attataatga cttttgtgcc ccaaggacta tatcaaggaa ttcttcttac ctgtctaggt 2520 

atctttctct ttgtatttta ccaactttat tacaaaaaat ttaatttaaa ataa 2574 

50 <212> Type : DNA 

<211> Length : 2574 

SequenceMame : SEQ ID 601 
SequenceDescription : 

55 Sequence 



<213> OrganismName : Streptococcus mutans UA159 
<40 0> PreSequenceString : 

atgaaattga aacatatttt aagaattgga gcggttgctt ttgcctcaat tcttttgtta 60 

60 actgcttgcg gatcaaaaac atctaaaaaa acagtaaccc ttgcgactgt tggaacaaca 12 0 

aatccatttt cttatgagaa aaagggaaaa ttgacgggat atgatatcga agttgctaag 18 0 

gaagttttca aagcttctga taaatacgat gtcaaatatc aaaaaacaga gtggaccagc 240 

attttctctg gtctagatag tgacaaatat caaatcggag ctaacaatat cagttatact 3 00 

aaagagcgtg ccaataaata tctttattct aatccaacgg cttccaatcc attggtatta 3 60 

65 gtggttccaa aagatagtga tattaagtct tataacgata ttgctgggca tagcactcaa 42 0 

gttgttcaag gaaatacaac agtgtctatg ctgcagaaat tcaataaaaa ccatgaaaac 480 

aatcaagtta aactaaactt taccagtgaa gatcttgcgc atcaaatccg gaatgtcagt 540 
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10 



15 



25 



30 



45 



50 



gatggtaagt atgattttaa aatttttgaa aaaatttcag cagaaacgat catcaaagag 600 
caaggacttg ataatttgaa agttattgat cttccttcag accaaaaacc atatgtttac 660 
tttatttttg cgcaagacca aaaagactta caaaagtttg tcaataaacg tctcaaaaaa 720 
ctttacgaga atggtacact tgaaaaatta tcgaaaaaat accttggagg aagctatctt 780 
ccagataaaa aagatatgaa ataa 804 
<212> Type : DNA 
<211> Length : 804 

SequenceName : SEQ ID 602 

SequenceDescription : 

Sequence 



<213> OrganistnName : Streptococcus mutans UA159 
<400> PreSeque^ceString : 

atgcgatttc ttgtctttct catcgcattt tttgctgctt tctataaatt tatcgagact 60 

gaacggattg attcaaatac agttgctgta aaccctgatt cgctcatttt aaagcgattt 120 

ttaaaaacaa atcaattaaa tgggatcatg attgtgacgg ggccagatgg taaggctcaa 18 0 

gtattttcaa atcaaagcaa ggtagatggc agtcctgttt caattaagga ttattttcct 240 

cttgcttctt tacaaaaatt gataacaggg gtggctatcc aacaattaat tgataaagga 3 00 

20 aaactgtctt taaacacacc tttaagcaaa tattatcctc aaattgaaaa tagtgaaaat 3 60 

atcacgatac aaaatttact tacccacaca agcggtttgg cagatcgaaa agaagttcct 42 0 

cagcaagtgc tgacaactca agagcagcaa ttggattttt cattgaccaa ttatcgcgta 4 80 

acttatcgaa aaaaatggaa gtatgctaac attaattatg ctttgctagc tggcattatc 54 0 

agtcaaatta gcggtcaaaa ttatgcgact tatgttcgtc aacacttctt aacagctggt 600 

aaggggtggc attttaaaaa gtatattcaa ataaaagata agtccaagtt agctgccttg 660 

tcagtgatgg atcaaagtac gacttgggat aagctgtcaa aagaagtgac atctaccttt 72 0 

ggagctggtg attatgcttc taggccagtg gattattgga aatttatgat ggcttttatt 780 

aatgaccaat ttgttcctgt cagcgaatac caacgttcta tgaaaatgac ttctaagagc 840 

tattatggcg gcctctatat cagccaaaag atgctgcatg caaatggtgg tggctttgat 9 00 

acttactctt gttttgctta ttcaaatcct aaaaccaaac aggtcatggt tttgtttatc 960 

acaaacggta agtataaacg ggtcaaatcc ttagcagcta aagcctttaa actatatgca 1020 

gattcgtatg cgctgaggaa aaatgaaacg tcaaaataa 1059 
<212> Type : DNA 
<211> Length : 1059 
35 SequenceName : SEQ ID 603 

SequenceDescription : 

Sequence 



40 <213> OrganismName : Streptococcus mutans UA159 
<400> PreSequenceString : 

atgaagaaaa aaatagctct agcagctctt tcttttgtca gtgcagctgt tcttgcagct 60 

tgcagctcag cacctggtgg ttcatcagat gcagctggta ataaaattgg agatactgta 12 0 

aaaattggtt acaatcttga attatcagga gatgtagccg cttatggaca agctgaaaag 180 

aacggtgcta accttgctgt tgaagagatt aataaggcag gcggcattga tggcaaaaag 240 

attaaagtta tctcaaaaga taataaatct gataacggtg aagcatcaac aatctcaact 3 00 

aatcttgcta cccaaagtaa agtaaatgct atcttgggac cagcaacatc tggtgctaca 3 60 

gcggctgctg ctcccaatgc caacgatgct gcagtaccac tcgtaacgcc ttctggaaca 42 0 

caagataatt tgacctattc aaaaggcaaa gttcaagatt acatcttccg tacaactttt 480 

caagatagct tccaaggaaa gatcattgcc aaatatgcaa cagataattt gaaagctaaa 540 

aaagtagcgc tttactatga taagtcaagt gattacgccc aaggtattgc tgatgcattc 600 

aaaaaagcat ataaagggaa gattactgtt gaagatacct ttcaagctaa agaccaagat 660 

ttccaagcag ctctgaccaa gtttaaaaat aaagactttg atgccattgt gataccaggt 720 

tattatactg aaactggtct gattacaaag caagcacgtg atatggggct tacccagcct 780 

55 atcttaggac ctgatggttt taatgatgaa aaatatgttg aaggtgctgg tgcagccaat 84 0 

accaataatg ttcattatgt atctggttac tcaacaaaag ttgctttaac aaataaggct 900 

gaaaaattcc tgaaagatta taaggctaag tatggtgaag agccaaatat gtttgccgct 960 

cttgcttatg attccgttta tatgattgct gatgctgcaa aagatgccaa aacatctaag 1020 

gatattgcaa caaacctagc taaattgaaa aactttaaag gtgtgacagg taaaatgaca 1080 

60 attgataaga aacataaccc tgttaaatca gccgttatgg ttggtcttaa agatggtaaa 1140 

gaagacacag ctactgctgt tgaagcaaaa taa 1173 
. <212> Type : DNA 
<211> Length : 1173 

SequenceName : SEQ ID 604 
65 SequenceDescription : 



Sequence 
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<213> Organ ismName : Streptococcus mutans UA159 
<4 00> PreSequenceString : 

atgaagaaat taagcttatt attactagtt tgtttatctt tattaggctt atttgcctgt 60 

5 acttctaaaa aaacagccga caaaaaattg actgttgtgg ctaccaattc tattattgct 120 

gatattacta agaatatcgc tggtaataag gttgtcttac atagtatcgt tcctgttggt 180 

cgagatcctc acgaatatga gcctcttcct gaagatgtta aaaagacctc tcaggctgat 240 

gtcatttttt ataatgggat taatcttgaa aatggaggca atgcttggtt taccaaacta 3 00 

gttaaaaatg ctcataaaaa gacagacaag gattattttg cagtgagcga tagtgttaag 3 60 

10 accatttatt tggaaaatgc aaaagaaaaa ggaaaggaag atcctcatgc ttggcttgac 420 

cttaaaaatg gtattattta tgctaaaaat atcatgaaac gtctatctga aaaagatcct 480 

aaaaacaaga gttattatca gaaaaatttt caagcctaca gcgccaaact tgaaaaacta 540 

cacaaagtag ccaaagaaaa aatcagtcgt atccctactg agaagaaaat gatcgtaact 60 0 

agcgaaggtt -gtttcaagta tttctctaag gcttacgata ttccttctgc ctatatatgg 660 

15 gaaattaata ccgaagaaga gggaacacca aatcaaatta aggctttagt gaaaaaatta 720 

aggaaaagtc gggtgtctgc gctttttgta gaaagcagtg ttgatgatcg tccaatgaaa 780 

actgtttcaa aagatacagg tatcccaatt gccgctaaaa tttttacaga ttcagttgct 84 0 

aaaaaaggac aggctggaga tagttactat gcgatgatga agtggaatat agataaaatt 900 

gcaaatggtc tgtcacaatg a 921 

20 <212> Type DNA 

<211> Length : 921 

SequenceName : SEQ ID 605 
SequenceDescription : 

25 Sequence 



60 



65 



55 <213> OrganismName : Streptococcus mutans UA159 
<400> PreSequenceString : 



<213> OrganismName : Streptococcus mutans UA159 
<400> PreSequenceString : 

atgtttgttc atactaagac taagaaaaaa agaaagtggc aaaggaaagt gtttctactg 60 

30 ctgcttcttt ttttattgcc tattgtgtca gtattggctt ttattgtgtt atttattggc 120 

ggtggtacag ctgagtctca tgatgtggaa gcgacgacag ggggcgttaa gctttcagct 180 

aagcaatttg cagataagac aaagttagga atttcagaag aggaagctaa aaatgcctta 240 

gcttttgcgg ataggttgat gtctcgtcat cattttacag ctcaagcaac tgctggagta 3 00 

ttggctgttg gctttcgtga aagtggcttt gatgtcaaag cagttaataa ttctggtggt 3 60 

35 gtagctggct ttttccaatg gtctggctgg ggtagttctg ttaatggtga tcgttggaaa 420 

gtagctagta aaagagagtt aactctagag gttgaggtag atttgatgag cactgaacta 480 

gatggtcgat atgctgatgt tgtcaaaaaa gttggttctg cgactgatga aaaacaggct 540 

gctaaggatt ggtctcagta ttatgaaggt gttgcggtta gtgatggtca aacgaaagct 600 

gataaaattg agagttgggc aacaactatt tgtgaggctt taaagtctgg tggtacaaat 660 

40 tatgctaaag tgaataatac gggaacaagt tctactgcta tcccgcaggg ttgggaaaat 720 

attagtgctt ttgatggcca tgcttatgaa ggtagtgaaa attatcctca aggacaatgc 780 

acttggtatg tttataatcg tgctaaacag ttgggtgtta gcttcagtcc ttatatgggg 840 

aatggcggtc agtggtatca agtgcaaggc taccattcta gtcatacacc taaagcacat 900 

acggctttat cttttgtcaa tggtcaggca ggttctgatc caacttatgg tcatgttgct 960 

45 tttgtagagg ctgttaaaga tgatgggagt attctaatca gtgagatgaa cgtttatggt 1020 

caaccagcta tgacggttgc ctatcggaca tttgatgctg aaactgctaa acaattttgg 1080 

tatgtagagg gaaaataa 1098 
<212> Type : DNA 
<211> Length : 1098 

50 SequenceName : SEQ ID 606 

SequenceDescription : 

Sequence 



ttactatttt 


attgggagct 


60 


attatacaaa 


tacggcttct 


120 


ctaaggctgt 


taattattgg 


180 


aagtcagtat 


taaggctggg 


240 


cgactgctct 


aactttcgat 


300 


ctgatcctaa 


tacaggagaa 


360 


atgataaggc 


tatgacatta 


420 


ctggtgttgt 


aaactttaac 


480 


atattacgcc 


aataagtcca 


540 


gtttgattca 


ctggaaagtt 


600 


tgactgatgt 


tgtcggagaa 


660 
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gatcaagaaa tcgtaaaaga ttccttggtt gctgcacgct tgcagtacat tgctggtgat 72 0 

gatgttgaca gtttagatga agctgcttcg cgaccttatg ctgaggattt ttcaaaaaat 780 

gttacttatc aaactaatga tttaggattg acaacaggat ttacctatac aattccagga 84 0 

tccagtaaca acgctatctt tatctcttat actactcgtt taacttcfctc tcaatctgct 9 00 

5 ggtaaagatg tcagcaacac tattgctatt tcaggaaata atattaatta ttccaatcaa 960 

acaggctacg ctcgtattga atccgcatat ggtagagcta gttctagagt aaagaggcaa 102 0 

gcagaaacaa caactgttac tgaaacaaca acttcgtcat cttctgaaac gacaactagt 1080 

gaagcgacaa cagaaacaag tagtacaaca aataataatt caactactac agaaacagct 1140 

actagcacaa caggagcttc aacaacacaa acaaaaacga ctgcttctca aacgaatgtt 12 00 

10 ccgacaacaa caaacataac aacaacttca aaacaagtaa ccaagcaaaa agcgaaattt 12 60 

gttttaccat caacaggtga acaagcaggg cttttgttaa ctactgtagg tcttgtaatt 1320 

gttgctgtgg caggtgtcta tttctataga acacgtcgtt aa 1362 
<212> Type : DNA 
<211> .Length, ; 1362 . . 

15 SequenceName : SEQ ID 607 

SequenceDe script ion : 



Sequence 



20 <213> OrganismName : Streptococcus mutans TJA159 
<4 00> PreSequenceString : 

atgacattta aaaagttagt tttaggtttg ttgagttttg tggctgtatt tactttagta 60 

gcttgcagtt cttctaattc aaaaaattta caggatgata ttaaagaaaa gaaaaagtta 12 0 

gttgttgctg ttagtccgga ctatgctcct tttgagttca aggctcttgt gaacggtaag 180 

25 gatactgttg ttggtgctga tattgatttg gcaaaagcaa ttgctaaaga attgggagtg 240 

aaactggaat tatcttccat gagttttgat aatgtcttgt ccagtttaaa aacaggaaaa 300 

gcagacatag ctatctctgg tttatcttat accaaggaac gtgctcaagc ctatgacttt 3 60 

tcagaagctt attataaaac ggaaaatgct attcttatta aaaagtctga tttgaacaaa 420 

tatacaatga tttcttcttt taataataag actaaagtag ctgttcaaaa aggaacgatt 480 

30 gaagaaggat tagctaaaaa tcaattaaaa caatcaaaca ttacctcttt gacttcgatg 540 

ggcgaagctg ttaatgagct caaatctggt caggttgatg ctattgatct tgaaaaacca 600 

gtggcagaag gttatgtgtc tcaaaatagt gatttggttc ttgccaaagt tgccttaaaa 660 

acgggtgaag gggatgccaa agcagttgct ctgcctaaag acagtggtca attagttaag 72 0 

acggtgaata aggttattaa gaaactcaaa aaagaagata aatacaagca gtttatcagc 780 

35 gatgctgtta aattaactgg tcagcaagtg gattga 816 



<212> Type : DNA 

<211> Length : 816 

SequenceName : SEQ ID 608 
SequenceDe script ion : 

40 

Sequence 



<213> OrganismName : Streptococcus mutans UA159 
<4 00> PreSequenceString : 

45 atgaaaaagc attttttcat gacttttagc ctcttgctag cggctgtttt tctagttgct 60 

tgttccaatc tttccgattc tggacagagg aattgggata agataaataa gagaggaatg 12 0 

cttaaaattg ctactgcagg aacgctttat ccgcaatctt atcatgatga tcataataaa 180 

ttgacgggtt atgatgttga aattctaaaa gaaataggaa aacgtttggg attgaaagtt 240 

cagtttactg aaatgggtgt cgatggtatg ctgacagcca tcaagagcgg tcagatcgat 3 00 

50 gttgctaatt attccctaga agacggcaac aaaaatatca gtaagttttt gagaacctct 360 

ccctataaat attcttttac gtcaatggtt gtccgctcta aagatgattc aggtattcat 420 

tcttggtcag accttaaggg aaaaaaagct gccggagctg ccagcactaa ttatatgaag 48 0 

attgctaaaa aattaggagc aaaattagtt gtctatgata atgtcaccaa cgatgtttat 540 

atgaaagatt tagttaatgg tcgtacagat gtcattatca atgattatta tctgcaaaag 600 

55 atagctgttg cagcagtcaa agacaaatac gctatcaaaa taaaccaagg actttatgcc 660 

aatccttaca gcactagttt tacattgtct ttgaaaaaca aagtactgca aaagaaaatc 720 

aataaggctg tgaaagacat gcgcaaggat ggcaccctaa ccaagctatc taagaagttt 780 

ttccaaggag aagacgtcac taaaaaacat tataatagct ataaaaaaat tgatatttct 840 

gacgttgatt aa 852 

60 <212> Type : DNA 

<211> Length : 852 

SequenceName : SEQ ID 609 
SequenceDescription : 



65 Sequence 



<213> OrganismName : Streptococcus pneumoniae R6 
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<40 0> PreSequenceString : 

atgaagcttt tgaaaaaaat gatgcaagtc gcactagcca catttttctt cggtttgcta 60 

gggaccagta cagtatttgc agatgattct gaaggatggc agtttgtcca agaaaatggt 12 0 

agaacctact acaaaaaggg ggctctaaaa gaaacctact ggagagtgat agatgggaag 18 0 

5 tactattatt ttgatccttt atccggagag atggttgtcg gctggcaata tatacctgct 24 0 

ccacacaagg gggttacgat tggtccttct ccaagaatag agattgctct tagaccagat 3 00 

tggttttatt ttggtcaaga tggtgtctta caagaatttg ttggcaagca agttttagaa 36 0 

gcaaaaactg ctacgaatac caacaaacat catggggaag aatatgatag ccaagcagag 42 0 

aaacgagtct attattttga agatcagcgt agttatcata ctttaaaaac tggttggatt 48 0 

10 tatgaagagg gttattggta ttatttacag aaggatggtg gctttgattc tcgcatcaac 54 0 

agattgacgg ttggagagct agcacgtggt tgggttaagg attaccctct tacgtatgat 60 0 

gaagagaagc taaaagcagc tccatggtac tatctagatc cagcaactgg ctggcaaaac 660 

cttgggaaca aatggtacta tctccgttca tcaggagcta tggcaactgg ttggtatcag 72 0 

.gaaggttcga cttggtacta tctaaatgca . agtaatggag atatgaaaac aggctggttc 780 

15 caagtcaatg gtaactggta ctatgcctat gattcaggtg ctttagctgt taataccaca 840 

Staggtggtt actacttaaa ctataatggt gaatgggtta agtaa 885 
<212> Type : DNA 
<211> Length : 885 

SequenceName : SEQ ID 610 



20 SequenceDescription : 

Sequence 



<213> OrganismName : Streptococcus pneumoniae R6 
25 <4 00> PreSequenceString : 

atgaaacttt tgaaaaaaat gatgcaagtt ctactagcag tctttttctt tggtttgcta 60 

gctacaaata cggtatttgc gaataccaca ggtggccgat ttgttgataa ggataataga 120 

aaatattatg taaaagatga tcataaagca atctattggc ataaaataga cggtaaaact 180 

tactattttg gtgatattgg agagatggtt gtcggttggc aatacttaga aattcctgga 240 

30 acaggttatc gtgataattt attcgataac caaccagtta atgaaattgg ccttcaggag 300 

aagtggtact attttggaca agatggtgct ttgctagaac aaacagataa acaagtacta 3 60 

gaggcaaaaa cgtctgaaaa tacaggaaaa gtatacggtg aacaatatcc tctatctgct 420 

gaaaagagaa cttattattt tgataataat tatgctgtaa agacaggctg gatttatgaa 48 0 

gacggcaatt ggtattattt aaataagcta ggaaattttg gcgatgattc ttacaatcca 54 0 

35 ctaccaattg gtgaagttgc taagggttgg actcaagatt ttcatgttac tattgacatt 60 0 

gatagaagca aacctgctcc atggtactac ctagatgctt caggtaagat gcttacagat 660 

tggcaaaaag taaacggaaa atggtattat tttggctcct ctggttctat ggcaacaggt 720 

tggaaatatg tacgaggcaa atggtattac ttagataata aaaatggtga tatgaaaaca 78 0 

ggatggcaat accttggtaa caagtggtac tacctccgtt catcaggagc tatggtaact 84 0 



40 ggctggtatc aagatggttt aacttggtac tacctaaatg caggtaatgg agacatgaag 
acaggttggt tccaggtcaa tggcaaatgg tactatgctt atagctcagg tgccttggca 
gtgaatacga ccgtagatgg ctattctgtc aactataatg gcgaatgggt tcaataa 

<212> Type : DNA 
45 <211> Length : 1017 

SequenceName : SEQ ID 611 
SequenceDescription : 

Sequence 
50 

<213> OrganismName : Streptococcus pneumoniae R6 

<400> PreSequenceString : 

atgaataaga aaaaaatgat tttaacaagt 

tttgttgcgt ctcagcctac tgttgtaaga 
55 aaagctgaga aagactatga tgcagcgaag 

gaagatgctc aaaaggcttt agatgatgca 

cagaagaaaa ctgaggagaa agccgcgcta 

gcagtggcag cagttcaaca agcgtatcta 

aaagacgcag cagataagat gatagatgaa 
60 aaatttaata ctgttcgagc aatggtagtt 

aaaaaatcag aagaagctaa acaaaaagca 

aaagcaaaat tagaagaggc tgagaaaaaa 

gaagaagtcg ctcctcaagc taaaatcgct 

caagagctca aagagattga tgagtctgaa 
65 gctcctcttc aatctaaatt ggatgccaaa 

agtgataaga ttgatgagtt agacgctgaa 

gctgaagaaa acaataatgt agaagactac 



ctagccagcg tcgctatctt aggggctggt 60 

gcagaagaat ctcccgtagc cagtcagtct 12 0 

aaagatgcta agaatgcgaa aaaagcagta 18 0 

aaagctgctc agaaaaaata tgacgaggat 240 

gaaaaagcag cgtctgaaga gatggataag 3 00 

gcctatcaac aagctacaga caaagccgca 3 60 

gctaagaaac gcgaagaaga ggcaaaaact 42 0 

cctgagccag agcagttggc tgagactaag 480 

ccagaactta ctaaaaaact agaagaagct 54 0 

gctactgaag ccaaacaaaa agtggatgct 600 

gaattggaaa atcaagttca tagactagaa 660 

tcagaagatt atgctaaaga aggtttccgt 720 

aaagctaaac tatcaaaact tgaagagtta 780 

attgcaaaac ttgaagatca acttaaagct 840 

tttaaagaag gtttagagaa aactattgct 900 
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gctaaaaaag ctgaattaga aaaaactgaa gctgacctta agaaagcagt taatgagcca 960 

gaaaaaccag ctccagctcc agaaactcca gccccagaag caccagctga acaaccaaaa 102 0 

ccagcgccgg ctcctcaacc agctcccgca ccaaaaccag agaagccagc tgaacaacca 1080 

aaaccagaaa aaacagatga tcaacaagct gaagaagact atgctcgtag atcagaagaa 1140 

5 gaatataatc gcttgactca acagcaaccg ccaaaagctg aaaaaccagc tcctgcacca 12 00 

aaaacaggct ggaaacaaga aaacggtatg tggtacttct acaatactga tggttcaatg 1260 

gcgacaggat ggctccaaaa caacggttca tggtactacc tcaacagcaa tggtgctatg 13 20 

gctacaggtt ggctccaata caatggttca tggtattacc tcaacgctaa cggtgctatg 13 8 0 

gcaacaggtt gggctaaagt caacggttca tggtactacc tcaacgctaa tggtgctatg 1440 

10 gctacaggtt ggctccaata caacggttca tggtattacc tcaacgctaa cggcgctatg 15 0 0 

gcaacaggtt gggctaaagt caacggttca tggtactacc tcaacgctaa tggtgctatg 1560 

gctacaggtt ggctccaata caacggttca tggtactacc tcaacgctaa cggtgctatg 162 0 

gctacaggtt gggctaaagt caacggttca tggtactacc tcaacgctaa tggtgctatg 1680 

• gcaacaggtt ..gggtgaaaga tggagatacc tggtactatc ttgaagcatc aggtgctatg. * 1740 

15 aaagcaagcc aatggttcaa agtatcagat aaatggtact atgtcaatgg tttaggtgcc 18 00 

cttgcagtca acacaactgt agatggctat aaagtcaatg ccaatggtga atgggtttaa 18 60 



<212> Type : DNA 
<211> Length : 1860 
20 SequenceName : SEQ ID 612 

SequenceDescription : 

Sequence 



25 <213> Organ isniName : Streptococcus pneumoniae R6 
<400> PreSequenceString : 

atgaaaattt taccgtttat agcaagagga acaagttatt acttgaagat gtcagttaaa 60 

aagcttgttc cttttttagt agtaggattg atgctagcag ctggtgatag tgtctatgcc 12 0 

tattccagag gaaatggatc gattgcgcgt ggggatgatt atcctgctta ttataaaaat 180 

30 gggagccagg agattgatca gtggcgcatg tattctcgtc agtgtacttc ttttgtagcc 240 

tttcgtttga gtaatgtcaa tggttttgaa attccggcag cttatggaaa tgcgaatgaa 3 00 

tggggacatc gtgctcgtcg ggaaggttat cgtgtagata atacaccgac gattggttcc 3 60 

attacttggt ctactgcagg aacttatggt catgttgcct gggtgtcaaa tgtaatggga 420 

gatcagattg agattgagga atataactat ggttatacag aatcctataa taaacgagtt 480- 

35 ataaaagcaa acacgatgac aggatttatt cattttaaag atttggatag tggcagtgtt 54 0 

gggaatagtc aatcctcagc ttcaacaggc ggaactcatt attttaagac caagtctgct 60 0 

attaaaactg aacccctagt tagtgcaact gtgattgatt actattatcc tggggagaag 66 0 

gttcattatg atcagatact tgaaaaagac ggctataagt ggttgagtta tactgcctat 720 

aatggaagct atcgttatgt tcaattggag gctgtgaata aaaatcctct aggtaattct 78 0 

40 gttctttctt caacaggagg aactcattat tttaagatca agtctgctat taaaactgaa 840 

cccctagtta gtgcaactgt gattgattac tattatcctg gagagaaggt tcattatgat 900 

cagatacttg aaaaagacgg ctataagtgg ttgagttata cggcttataa cggaagtcgt 960 

cgctatatac agctagaggg agtgacttct tcacaaaatt atcagaatca atcaggaaat 102 0 

atctctagct atggatccaa taatagttca actgtcggtt ggaagaaaat aaatggtagt 1080 

45 tggtatcatt tcaaatcaaa tggttctaaa tcaacaggat ggctgaaaga cggttctagc 114 0 

tggtattatt tgaaattatc tggtgaaatg cagacaggat ggttaaagga gaatggctcg 1200 

tggtattatc tgggtagttc aggggcaatg aaaacaggct ggtaccaggt ctctggtgag 1260 

tggtattatt cttactcttc aggcgcctta gctattaata cgacggtgga tggctacaga 132 0 

gtaaacagtg atggagaacg agtatag 1347 

50 <212> Type : DNA 

<211> Length : 1347 

SequenceName : SEQ ID 613 
SequenceDescription : 



55 Sequence 



<213> OrganismName : Streptococcus pneumoniae R6 
<400> PreSequenceString : 

atgtttgcat caaaaagcga aagaaaagta cattattcaa ttcgtaaatt tagtattgga 60 

60 gtagctagtg tagctgttgc cagtcttgtt atgggaagtg tggttcatgc gacagagaac 120 

gagggaagta cccaagcagc cacttcttct aatatggcaa agacagaaca taggaaagct 18 0 

gctaaacaag tcgtcgatga atatatagaa aaaatgttga gggagattca actagataga 240 

agaaaacata cccaaaatgt cgccttaaac ataaagttga gcgcaattaa aacgaagtat 300 

ttgcgtgaat taaatgtttt agaagagaag tcgaaagatg agttgccgtc agaaataaaa 360 

65 gcaaagttag acgcagcttt tgagaagttt aaaaaagata cattgaaacc aggagaaaag 42 0 

gtagcagaag ctaagaagaa ggttgaagaa gctaagaaaa aagccgagga tcaaaaagaa 480 

gaagatcgtc gtaactaccc aaccaatact tacaaaacgc ttgaacttga aattgctgag 540 
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ttcgatgtga aagttaaaga agcggagctt gaactagtaa aagaggaagc taaagaatct 600 

cgaaacgagg gcacaattaa gcaagcaaaa gagaaagttg agagtaaaaa agctgaggcfc 660 

acaaggttag aaaacatcaa gacagatcgt aaaaaagcag aagaagaagc taaacgaaaa 72 0 

gcagatgcta agttgaagga agctaatgta gcgacttcag atcaaggtaa accaaagggg 78 0 

5 cgggcaaaac gaggagttcc tggagagcta gcaacacctg ataaaaaaga aaatgatgcg 840 

aagtcttcag attctagcgt aggtgaagaa actcttccaa gctcatccct gaaatcagga 90 0 

aaaaaggtag cagaagctga gaagaaggtt gaagaagctg agaaaaaagc caaggatcaa 960 

aaagaagaag atcgccgtaa ctacccaacc aatacttaca aaacgcttga ccttgaaatt 102 0 

gctgagtccg atgtgaaagt taaagaagcg gagcttgaac tagtaaaaga ggaagctaag 10 80 

10 gaacctcgag acgaggaaaa aattaagcaa gcaaaagcga aagttgagag taaaaaagct 1140 

gaggctacaa ggttagaaaa catcaagaca gatcgtaaaa aagcagaaga agaagctaaa 1200 

cgaaaagcag cagaagaaga taaagttaaa gaaaaaccag ctgaacaacc acaaccagcg 12 60 

ccggctactc aaccagaaaa accagctcca aaaccagaga agccagctga acaaccaaaa 13 2 0 

gcagaaaaaa cagatgatca acaagctgaa gaagactatg ctcgtagato- agaagaagaa 1380 

15 tataatcgct tgactcaaca gcaaccgcca aaaactgaaa aaccagcaca accatctact 1440 

ccaaaaacag gctggaaaca agaaaacggt atgtggtact tctacaatac tgatggttca 15 00 

atggcaacag gatggctcca aaacaacggt tcatggtact atctaaacgc taatggtgct 1560 

afcggcgacag gatggctcca aaacaatggt tcatggtact atctaaacgc taatggttca 162 0 

atggcaacag gatggctcca aaacaatggt tcatggtact acctaaacgc taatggtgct 1680 

20 atggcgacag gatggctcca atacaatggt tcatggtact acctaaacag caatggcgct 1740 

atggcgacag gatggctcca atacaatggc tcatggtact acctcaacgc taatggtgat 18 0 0 

atggcgacag gatggctcca aaacaacggt tcatggtact acctcaacgc taatggtgat 18 60 

atggcgacag gatggctcca atacaacggt tcatggtatt acctcaacgc taatggtgat 192 0 

atggcgacag gttgggtgaa agatggagat acctggtact atcttgaagc atcaggtgct 19 80 

25 atgaaagcaa gccaatggtt caaagtatca gataaatggt actatgtcaa tggctcaggt 2 04 0 

gcccttgcag tcaacacaac tgtagatggc tatggagtca atgccaatgg tgaatgggta 2100 

aactaa 2106 
<212> Type : DNA 
<211> Length : 2106 

30 SequenceName : SEQ ID 614 

SequenceDescription : 



Sequence 



35 <213> Organi STriName : Streptococcus pneumoniae R6 
<4 00> PreSequenceString : 

- atgaaaaaaa ctacaatatt atcattaact acagctgcgg ttattttagc agcatatgtc 60 

cctaatgaac caatcctagc agcatatgtc cctaatgaac caatcctagc agatactcct 120 

agttcggaag taatcaaaga gactaaagtt ggaagtatta ttcaacaaaa taatatcaaa 180 

40 tataaggttc taactgtaga aggtaacata ggaactgttc aagtgggtaa tggagttact 240 

cctgtagagt ttgaagctgg tcaagatgga aaaccattca cgattcctac aaaaatcaca 3 00 

gtaggtgata aagtatttac cgttactgaa gtagctagtc aagcttttag ttattatcca 3 60 

gatgaaacag gtagaattgt ctactatcct agctctatta ctatcccatc aagcataaaa 420 

aaaatacaaa aaaaaggctt ccatggaagt aaagctaaaa ctattatttt tgacaaaggc 4 80 

45 agtcagctgg agaaaattga agatagagct tttgattttt ctgaattaga agagattgaa 540 

ttgcctgcat ctctagaata tattggaaca agtgcatttt ctfcttagtca aaaattgaaa 6 00 

aagctaacct tttcctcaag ttcaaaatta gaattaatat cacatgaggc ttttgctaat 660 

ttatcaaatt tagagaaact aacattacca aaatcggtta aaacattagg aagtaatcta 72 0 

tttagactca ctactagctt aaaacatgtt gatgttgaag aaggaaatga atcgtttgcc 780 

50 tcagttgatg gtgttttgtt ttcaaaagat aaaactcaat taatttatta tccaagtcaa 840 

aaaaatgacg aaagttataa aacgcctaag gagacaaaag aacttgcatc atattcgttt 900 

aataaaaatt cttacttgaa aaaactcgaa ttgaatgaag gtttagaaaa aatcggtact 960 

tttgcatttg cggatgcgat taaacttgaa gaaattagct taccaaatag tttagaaact 1020 

attgaacgtt tagcctttta cggtaattta gaattaaaag aacttatatt accagataat 1080 

55 gttaaaaatt ttggtaaaca cgttatgaac ggtttaccaa aatttttaac attatctggt 1140 

aataatatca actcattgcc gtccttcttc ctaagtggcg tcttagattc attaaaggaa 1200 

attcatatta agaataaaag tacagagttt tctgtgaaaa aagatacatt tgcaattcct 12 60 

gaaactgtta agttctatgt aacatcagaa catataaaag atgttcttaa atcaaattta 1320 

tctactagta atgatatcat tgttgaaaaa gtagataata taaaacaaga aactgatgta 13 8 0 

60 gctaaaccta aaaagaattc taatcaggga gtagttggtt gggttaaaga caaaggttta 1440 

tggtattact taaacgaatc aggttcaatg gctactggtt gggttaaaga caaaggttta 1500 

tggtattact taaacgaatc aggttcaatg gctactggtt gggttaaaga caaaggttta 1560 

tggtattact taaatgaatc aggttcaatg gctactggtt gggttaaaga caaaggctta 1620 

tggtattact taaacgaatc aggttcaatg gctactggtt gggttaaaga caaaggctta 1680 

65 tggtattact taaacgaatc aggttcaatg gctactggtt gggttaaaga caaaggctta 1740 

tggtattact taaatgaatc aggttcaatg gctactggtt gggttaaaga caaaggctta 1800 

tggtattact taaacgaatc aggttcaatg gctactggtt gggttaaaga caaaggctta 18 60 
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tggtattact taaacgaatc aggttcaatg gctactggtt gggttaaaga caaaggctta 1920 
tggtattact taaatgaatc aggttcaatg gctactggtt gggttaaagt ttctggtaaa 1980. 
tggtactata cctataattc aggagattta ttagtaaaca cgactacacc cgatggctat 2040 
cgagtcaatg ctaacggtga gtgggtagga tag 2073 
5 <212> Type : DNA 

<211> Length : 2073 

SequenceName : SEQ ID 615 

SequenceDescription : 

10 Sequence 



<213> OrganismMarae : Streptococcus pneumoniae R6 
<400> PreSequenceString : 

atggaaatta < atgtgagtaa attaagaaca gatttgcctc aagtcggcgt gcaaccatat 60 

15 aggcaagtac acgcacactc aactgggaat ccgcattcaa ccgtacagaa tgaagcggat 12 0 

tatcactggc ggaaagaccc agaattaggt tttttctcgc acattgttgg gaacggttgc 180 

atcatgcagg taggacctgt tgataatggt gcctgggacg ttgggggcgg ttggaatgct 240 

gagacctatg cagcggttga actgattgaa agccattcaa ccaaagaaga gttcatgacg 3 00 

gactaccgcc tttatatcga actcttacgc aatctagcag atgaagcagg tttgccgaaa 360 

20 acgcttgata cagggagttt agctggaatt aaaacgcacg agtattgcac gaataaccaa 420 

ccaaacaacc actcagacca cgttgaccct tatccatatc ttgctaaatg gggcattagc 480 

cgtgagcagt ttaagcatga tattgagaac ggcttgacga ttgaaacagg ctggcagaag 540 

aatgacactg gctactggta cgtacattca gacggctctt atccaaaaga caagtttgag 600 

aaaatcaatg gcacttggta ctactttgac agttcaggct atatgcttgc agaccgctgg 660 

25 aggaagcaca cagacggcaa ctggtactgg ttcgacaact caggcgaaat ggctacaggc 72 0 

tggaagaaaa tcgctgataa gtggtactat ttcaacgaag aaggtgccat gaagacaggc 780 

tgggtcaagt acaaggacac ttggtactac ttagacgcta aagaaggcgc catggtatca 840 

aatgccttta tccagtcagc ggacggaaca ggctggtact acctcaaacc agacggaaca 900 

ctggcagaca ggccagaatt cacagtagag ccagatggct tgattacagt aaaataa 957 



30 

<212> Type : DNA 

<211> Length : 957 

SequenceName : SEQ ID 616 
SequenceDescription : 

35 

Sequence 



<213> OrganismName : Neisseria meningitidis serogroup A strain Z2491 
<400> PreSequenceString : 

40 atgactttcg cctattggtg tattctgatt gcctacctat tgccgctttt ttgtgcggcg 60 
tatgccaaaa aagcgggcgg attccggttt aaagacaacc acaatccgcg cgattttctg 120 
gcgcgcacgc aaggcacagc cgcccgtgcc cacgccgcgc agcaaaacgg ttttgaagcc 180 
tttgcaccgt ttgcagccgc cgttttgacg gcacacgcaa ccggcaatgc cggacaagca 240 
accgtcaaca cgcttgccgg cctgttcatc ctgttccgcc tcgcctttat ctggtgctac 3 00 

45 atcgcagaca aagcagcatt acgctcgctg atgtgggtgg gcggatttgt ctgcaccgtc 360 
gggctgtttg tcgtggctgc ttga 3 84 

<212> Type : DNA 
,<211> Length : 384 

SequenceName : SEQ ID 617 

50 SequenceDescription : 

Sequence 



<213> OrganismName : Neisseria meningitidis serogroup A strain Z2491 



55 <400> PreSequenceString : 

atgaacaaaa tataccgcat catttggaat agtgccctca atgcctgggt cgccgtatcc 6 0 

gagctcacac gcaaccacac caaacgcgcc tccgcaaccg tgaagaccgc cgtattggcg 12 0 

acactgttgt ttgcaacggt tcaggcgaat gctaccgatg aagatgaaga agaagagtta 180 

gaatccgtac aacgctctgt cgtagggagc attcaagcca gtatggaagg cagcggcgaa 240 

60 ttggaaacga tatcattatc aatgactaac gacagcaagg aatttgtaga cccatacata 30 0 

gtagttaccc tcaaagccgg cgacaacctg aaaatcaaac aaaacaccaa tgaaaacacc 360 

aatgccagta gcttcaccta ctcgctgaaa aaagacctca caggcctgat caatgttgaa 42 0 

actgaaaaat tatcgtttgg cgcaaacggc aagaaagtca acatcataag cgacaccaaa 480 

ggcttgaatt tcgcgaaaga aacggctggg acgaacggcg acaccacggt tcatctgaac 540 

65 ggtatcggtt cgactttgac cgatacgctt gcgggttctt ctgcttctca cgttgatgcg 600 

ggtaaccaaa gtacacatta cactcgtgca gcaagtatta aggatgtgtt gaatgcgggt 660 

tggaatatta agggtgttaa aactggctca acaactggtc aatcagaaaa tgtcgatttc 72 0 
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gtccgcactt acgacacagt cgagttcttg agcgcagata cgaaaacaac gactgttaat 780 

gtggaaagca aagacaacgg caagagaacc gaagttaaaa tcggtgcgaa gacttctgtt 84 0 

attaaagaaa aagacggtaa gttggttact ggtaaaggca aaggcgagaa tggttcttct 900 

acagacgaag gcgaaggctt agtgactgca aaagaagtga ttgatgcagt aaacaaggct 960 

5 ggttggagaa tgaaaacaac aaccgctaat ggtcaaacag gtcaagctga caagtttgaa 1020 

accgttacat caggcacaaa tgtaaccttt gctagtggta aaggtacaac tgcgactgta 108 0 

agtaaagatg atcaaggcaa catcactgtt atgtatgatg taaatgtcgg cgatgcccta 114 0 

aacgtcaatc agctgcaaaa cagcggttgg aatttggatt ccaaagcggt tgcaggttct 12 0 0 

tcgggcaaag tcatcagcgg caatgtttcg ccgagcaagg gaaagatgga tgaaaccgtc 1260 

10 aacattaatg ccggcaacaa catcgagatt agccgcaacg gtaaaaatat cgacatcgcc 132 0 

acttcgatgg cgccgcagtt ttccagcgtt tcgctcggcg cgggggcaga tgcgcccact 1380 

ttaagcgtgg atgacgaggg cgcgttgaat gtcggcagca aggatgccaa caaacccgtc 1440 

cgcattacca atgtcgcccc gggcgttaaa gagggggatg ttacaaacgt cgcacaactt 150 0 

aaaggcgtgg -cgcaaaactt gaacaaccgc atcgacaatg tggacggcaa cgcgcgtgcg , 156.0. 

15 ggcatcgccc aagcgattgc aaccgcaggt ctggttcagg cgtatctgcc cggcaagagt 162 0 

atgatggcga tcggcggcgg cacttatcgc ggcgaagccg gttacgccat cggctactcc 168 0 

agtatttccg acggcggaaa ttggattatc aaaggcacgg cttccggcaa ttcgcgcggc 1740 

catttcggtg cttccgcatc tgtcggttat cagtggtaa 1779 
<212> Type : DNA 

20 <211> Length : 1779 

SeguenceName : SEQ ID 618 
SequenceDescription : 



Sequence 
25 

<213> OrganisraName : Neisseria meningitidis serogroup A strain Z2491 



<400> PreSequenceString : 

attcttttgg ctgaaggtca aaaatcagcc gtcaccgagt attacctgaa tcacggcaca 60 

tggcccagca acaacagtga tgccggcgtg gcatccaccg ccaccgacat caaaggaaaa 120 

30 tatgttaaag aagttaaagt cgaaaaaggc gtcattaccg ccacaatgct ttcaagcggc 180 

gtaaacaacg aaatcaaagg caaaaaactc tccctgtggg ccaagcgtca agccggttcg 240 

gtaaaatggt tctgcggaca gccggttgag cgcgccgcca acaacgccgc caacgacgcc 3 00 

gtcaccgccg ccaccgccaa cggcaacggc aagatcgaca ccaaacacct gccgtcaacc 3 60 

tgccgcgacg cagcatctgc cgtttgcata gaaacaccac ctacggcttt ctataaaaat 42 0 

35 acctaa 426 



<212> Type : DNA 

<211> Length : 426 

SequenceName : SEQ ID 619 
SequenceDescription^ : 

40 

Sequence 



<213> OrganismName : Neisseria meningitidis serogroup A strain Z2491 



<400> PreSequenceString : 

45 atgaaaacaa ccgacaaacg gacaaccgaa acacaccgca aagccccgaa aaccggccgc 60 

atccgcttct cgcctgctta cttagccata tgcctgtcgt tcggcattct tccccaagct 120 

tgggcgggac acacttattt cggcatcaac taccaatact atcgcgactt tgccgaaaat 180 

aaaggcaagt ttgcagtcgg ggcgaaagat attgaggttt acaacaaaaa aggggagttg 240 

gtcggcaaat caatgacaaa agccccgatg attgattttt ctgtggtgtc gcgtaacggc 3 00 

50 gtggcggcat tggtgggcga tcaatatatt gtgagcgtgg cacataacgg cggctataac 3 60 

aacgttgatt ttggtgcgga aggaagaaat cccgatcagc accgtttttc ttaccaaatt 420 

gtgaaaagaa ataattataa gcctgacaat tcacaccctt acaacggcga ttaccatatg 4 80 

ccgcgtttgc ataaatttgt cacagatgca gaacctgtcg aaatgacgag tgacatgagg 540 

gggaatacct attccgataa agaaaaatat cccgagcgtg tccgcatcgg ctcaggacac 600 

55 cactattggc gttatgatga tgacaaacac ggcgatttat cctactccgg cgcatggtta 660 

attggcggca atacacatat gcagggttgg ggaaataatg gcgtagttag tttgagcggc 720 

gatgtgcgcc atgccaacga ctatggccct atgccgattg caggtgcggc aggcgacagc 780 

ggttcgccaa tgtttattta tgacaaaaca aacaataaat ggctgctcaa cggagtttta 840 

caaaccggct acccttattc cggcagggaa aacggtttcc agctgatacg caaagattgg 900 

60 ttctacgatg acatttacag aggcgataca cataccgtct tttttgaacc gcgcagtaac 960 

ggacattttt cctttacatc caacaacaac ggtacgggta cggtaacaga aaccaacgaa 1020 

aaggtttcca atccaaagct taaagtacag acagtccgac tgtttgacga atctttgaat 1080 

gaaactgata aagaaccagt ttacgcggca gggggtgtta atcagtaccg tccaaggtta 1140 

aacaacggtg aaaacctttc ttttatcgat tacggcaacg gcaaactcat cttatcaaac 1200 

65 aacatcaacc aaggcgcggg cggtttgtat tttgaaggtg attttacggt ctcgcctgaa 1260 

aacaacgaaa cgtggcaagg cgcgggcgtt catatcagtg aagacagtac cgttacttgg 1320 

aaagtaaacg gcgtggcaaa cgaccgcctg tccaaaatcg gcaaaggcac gctgcacgtt 1380 
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55 



Sequence 



caagccaaag gggaaaacca aggctcgatc agcgtgggcg acggtacagt cattttggat 1440 

cagcaggcag acgataaagg caaaaaacaa gcctttagtg aaatcggctt ggtcagcggc 150 0 

aggggtacgg tgcaactgaa tgccgataat cagttcaacc ccgacaaact ctatttcggc 1560 

tttcgcggcg gacgtttgga tttaaacggg cattcgcttt cgttccaccg tattcaaaat 162 0 

5 accgatgaag gggcgatgat tgtcaatcat aatgccacaa caacatccac cgttaccatt 1680 

acagggaatg aaagtattac acaaccgagt ggtaagaata tcaatagact taattacagc 1740 

aaagaaattg cctacaacgg ttggtttggc gagaaagata cgaccaaaac gaacgggcgg 180 0 

ctcaaccttg tttaccagcc cgccgcagaa gaccgcaccc tgctgctttc cggcggaaca 18 60 

aatttaaacg gcaacatcac gcaaacaaac ggcaaactgt ttttcagcgg cagaccgaca 1920 

10 ccgcacgcct acaatcattt aggaagcggg tggtcaaaaa tggaaggtat cccacaagga 19 8 0 

gaaatcgtgt gggacaacga ctggatcaac cgcacgttta aagcggaaaa tttccatatt 2040 

cagggcgggc aggcggtgat ttcccgcaat gttgccaaag tggaaggcga ttggcatttg 210 0 

agcaatcacg cccaagcagt ttttggtgtc gcaccgcatc aaagccatac aatctgtaca 2160 

cgttcggact ggacgggtct^gacaaattgt gtcgaaaaaa., ccattaccga cgataaagtg.- -2220 

15 attgcttcat tgactaagac cgacatcagc ggcaatgtca gccttgccga tcacgctcat 228 0 

ttaaatctca cagggcttgc cacactcaac ggcaatctta gtgcaaatgg cgatacacgt 234 0 

tatacagtca gccacaacgc cacccaaaac ggcaacctta gcctcgtggg caatgcccaa 2400 

gcaacattta atcaagccac attaaacggc aacacatcgg cttcgggcaa tgcttcattt 2460 

aatctaagca acaacgccgc acaaaacggc agtctgacgc tttccgacaa cgctaaggca 252 0 

20 aacgtaagcc attccgcact caacggcaat gtctccctag ccgataaggc agtattccat 258 0 

tttgaaaaca gccgctttac cggacaactc agcggcagca aggatacagc attacactta 2 64 0 

aaagacagcg aatggacgct gccgtcaggc acggaattag gcaatttaaa ccttgacaac 27 0 0 

gccaccatta cactcaattc cgcctatcgc cacgatgctg caggcgcgca aaccggcagt 2760 

gtgtcagaca cgccgcgccg ccgttcgcgc cgttccctat tatccgttac accgccaact 282 0 

25 tcggtagaat cccgtttcaa cacgctgacg gtaaacggca aattgaacgg tcaaggaaca 2880 

ttccgcttta tgtcggaact cttcggctac cgaagcgaca aattgaagct ggcggaaagt 2940 

tccgaaggca cttacacctt ggcggtcaac aataccggca acgaacccgt aagcctcgat 30 00 

caattgacgg tagtggaagg gaaagacaac aaaccgctgt ccgaaaacct taatttcacc 3 0 60 

ctgcaaaacg aacacgtcga tgccggcgcg tggcgttacc aactcatccg caaagacggc 312 0 

30 gagttccgcc tgcataatcc ggtcaaagaa caagagcttt ccgacaaact cggcaaggca 3180 

gaagccaaaa aacaggcgga aaaagacaac gcgcaaagcc ttgacgcgct gattgcggcc 3240 

gggcgcgatg ccgccgaaaa gacagaaagc gttgccgaac cggcccggca ggcaggcggg 33 0 0 

gaaaatgtcg gcattatgca ggcggaggaa gagaaaaaac gggtgcaggc ggataaagac 3360 

agcgccttgg cgaaacagcg cgaagcggaa acccggccgg ctaccaccgc cttcccccgc 342 0 

35 gcccgccgcg cccgccggga tttgccgcaa ccgcagcccc aaccgcaacc tcaaccccaa 348 0 

ccgcagcgcg acctgatcag ccgttatgcc aatagcggtt tgagtgaatt ttccgccacg 3 54 0 

ctcaacagcg ttttcgccgt acaggacgaa ttggaccgcg tgtttgccga agaccgccgc 360 0 

aacgccgttt ggacaagcgg catccgggac accaaacact accgttcgca agatttccgc 3 660 

gcctaccgcc aacaaaccga cctgcgccaa atcggtatgc agaaaaacct cggcagcggg 3 72 0 

40 cgcgtcggca tcctgttttc gcacaaccgg accgaaaaca ccttcgacga cggcatcggc 3780 

aactcggcac ggcttgccca cggcgccgtt ttcgggcaat acggcatcgg caggttcgac 3 840 

atcggcatca gcacgggcgc gggttttagc agcggcagtc tttcagacgg catcggaggc 390 0 

aaaatccgcc gccgcgtgct gcattacggc attcaggcac gataccgcgc cggtttcggc 3 960 

ggattcggca tcgaaccgta catcggcgca acgcgctatt tcgtccaaaa agcggattac 402 0 

45 cgctacgaaa acgtcaatat cgccaccccc ggtcttgcgt tcaaccgcta ccgcgcgggc 408 0 

attaaggcag attattcatt caaaccggcg caacacattt ccatcacgcc ttatttgagc 4140 

ctgtcctata ccgatgccgc ttcgggcaaa gtccgaacac gcgtcaatac cgccgtattg 4200 

gctcaggatt tcggcaaaac ccgcagtgcg gaatggggcg taaacgccga aatcaaaggt 4260 

ttcacgctgt ccctccacgc tgccgccgcc aaaggcccgc aactggaagc gcaacacagc 432 0 

50 gcgggcatca aattaggcta ccgctggtaa 43 50 
<212> Type : DNA 
<211> Length : 4350 

SequenceName : SEQ ID 620 
SequenceDescription : 



<213> OrganismName : Neisseria meningitidis serogroup A strain 22491 
<400> PreSequenceString : 

60 atgaacaccc ttcaaaaagg ttttaccctt atcgagctga tgattgtgat tgccatcgtc 60 

ggcattttgg cggcagtcgc ccttcctgct tatcaagact acacagcccg cgcacaagtt 120 

tccgaagcca ttcttttagc cgaaggtcaa aaatcagccg tcaccgagta ttacctgaat 180 

cacggcgaat ggcccagcaa caacacttct gccggcgtgg catcctccac cgacattaaa 240 

ggcaaatatg ttcaaagcgt tgaagtcaaa aacggcgtcg ttaccgccac aatggcttca 3 00 

65 agcaacgtaa acaacgaaat caaaggcaaa aaactctccc tgtgggccaa gcgtcaagac 360 

ggttcggtaa aatggttctg cggacagccg gttaagcgca acgacaccgc caccaccaac 420 

gacgacgtca aagccgacac cgccgccaac ggcaagcaga tcgacaccaa gcacctgccg 480 
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tcaacctgcc gcgacgcagc atctgccgga taa 513 
<212> Type : DNA 
<211> Length : 513 

SequenceName : SEQ ID 621 
5 SequenceDescription : 

«.( 

Sequence 



<213> OrganismName : Neisseria mc 

10 <400> PreSequenceString : 

atgcaagcac ggctgctgat acctattctt 
acactgacag gtattccatc gcatggcgga 
gtggccgctt ctgccagagc tgccgttaaa 
aaagttgcat tgtacattgc aactatgggc 

15 cgctactcca ttgatgcact gattcgtggc 
gattacacct atccacgtta cgaaaccacc 
ttaaccactt ctttatctac acttaatgcc 
agcggaagta aaagcagtct gggcttaaat 
accttgacga ctaacccgcg cgacactgcc 

20 ttcctgcgcg gcatagacgfc tgtttctcct 
atcgacgtat tcggaacgat acgcaacaga 
ctgaaagccc aaacaaaact ggaatatttc 
atcaaaccaa aaaccaatgc gtttgaagct 
ggaccgtata aagtaagcaa aggaattaaa 

25 gatatccaac catacggcaa tcatatgggt 
agtcatgagg ggtatggata cagcgatgaa 
tga 

<212> Type : DNA 
<211> Length : 963- 
30 SequenceName : SEQ ID 622 

SequenceDescription : 



ningitidis serogroup A strain 22491 

ttttcagttt ttattttatc cgcctgcggg 60 

ggtaaacgct ttgcggtcga acaagaactt 120 

gacatggatt tacaggcatt acacggacga 180 

gaccaaggtt caggcagttt gacagggggt 740 

gaatacataa acagccctgc cgtccgtacc 3 00 

gctgaaacaa catcaggcgg tttgacaggt 3 60 

cctgcactct cgcgcaccca atcagacggt 420 

attggcggga tgggggatta tcgaaatgaa 48 0 

tttctttccc acttggtaca gaccgtattt 540 

gccaatgccg atacggatgt gtttattaac 600 

accgaaatgc acctatacaa tgccgaaaca 660 

gcagtagaca gaaccaataa aaaattgctc 720 

gcctataaag aaaattacgc attgtggatg 780 

ccgacagaag gattaatggt cgatttctcc 840 

aactctgccc catccgtaga ggctgataac 900 

gcagtgcgac gacatagaca agggcaacct 960 

963 



Sequence 

35 <213> OrganismName : Neisseria meningitidis serogroup A strain 22491 
<400> PreSequenceString : 

atgcgcccaa tcttcctatc tttcgtttta ttccctattt tgataaccgc ctgcagcaca 60 
ccggacaagt ctgcccgatg ggaaaatatc ggcacaatct caaacggcaa tattcataca 120 
tatatcaata aagacagcgt gagaaaaaac ggaaatctga tgattttcca agataaaaaa 18 0 

40 gttgttacca atctaaaaca agaacgtttt gccaacaccc ccgcatacaa gactgccatt 24 0 

gccgagtggg aaatccactg caacaacaaa acataccgct taagttcgct acaattgttt 300 
gatacaaaaa acacggaaat ttccacacaa aactacacag cctcttccct ccgcccgatg 360 
agcatcctgt ccgggacatt aaccgaaaaa caatatgaaa ccgtatgcgg aaaaaaactc 42 0 

tga 

45 <212> Type DNA 

<211> Length : 423 

SequenceName : SEQ ID 623 
SequenceDescription : 

50 Sequence 



65 



Sequence 

<213> OrganismName : Neisseria meningitidis serogroup A strain 22491 
<400> PreSequenceString : 



423 



<213> OrganismName : Neisseria meningitidis serogroup A strain Z2491 
<400> PreSequenceString : 

atgaacaaac ttttcattac cgccctgtcc gccctcgcct tgtccgcctg cgccggcact 60 
55 tgggagggcg cgaaacaaga caccgcccgc aaccttgaca aaacacaggc cgccgccgaa 120 
cgcgccgccg aacaaacagg caacgccgtc gaaaaaggct gggacaaaac caaagaagcc 180 
gtcaaaaaag gcggcaatgc cgtcggacgc ggcatttccc atctcggcgg aaaaatcgaa 240 
aacgccaccg aataa 255 
<212> Type : DNA 
60 <211> Length : 255 

SequenceName : SEQ ID 624 
SequenceDe script ion : 
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atgaaactcc tcttcatccc cctagtcctc ttcgtcgccg tcgaacattt ctacatcgcc SO 

tggcttgaaa tgacgcagat tcccagcgaa aaagcggcgg aaacgttcaa gctgccttat 120 

gaatttatgg aacaaaatcg cgtgcagacc ctgttcggca accaagggct gtataacggc 18 O 

tttctcggca tcgggctggt gtggtcgcgg tttgccgctc cggataacgc ggtgtacggc 24 O 

5 gcaacggtac tgtttctcgg cttcgtcctg attgccgccg cgtggggcgc gttctcttcc 3 0O 

ggcaacaaag gcatactcgt caaacaaggt ttgcccgcat ttttggcagc ggcggcggtg 3 60 

ttggcggtat ga " "~ " J ' " 372 

<212> Type : DNA 
<211> Length : 372 
10 SequenceName : SEQ ID 625 

SequenceDescription : 

Sequence 



13 <213> OrganismName : Neisseria meningitidis serogroup A strain Z2491 
<400> PreSequenceString : 

atggcttcaa gcaacgtaaa caacgaaatc aaagacaaaa aactctccct gtgggccaag GO 
cgtcaagacg gttcggtaaa atggttctgc ggacagccgg ttaagcgcga cgccgccacc 120 
gacgccgacg tcaccgccga cagcggcaac gaaatcgaca ccaagcacct gccgtcaacc ISO 
20 tgccgcgacg cagcatctgc cgtttgcaca aaaacacccg agtattaccc gaatcacggc 240 
gaatggccga aaaacttcgt cattcccgcg caggcgggaa tccaggtctg tcggcacgga 3 0O 

aacttatcgg gtaaaaaggt ttctccggtc ctgagttcta gattcccact ttcgtgggaa 360 
tga 

<212> Type : DNA 
25 <211> Length : 363 

SequenceName : SEQ ID 626 
SequenceDescription : 



363 



Sequence 
30 

<213> OrganismName : Neisseria meningitidis serogroup A strain Z2491 
) <400> PreSequenceString : 

atccttttag ccgaaggtca aaaatcagcc gtcaccgagt attacctgaa tcacggcgaa > 60 

tggcccagca acaacacttc tgccggcgtg gcaacctcca ccgacattaa aggcaaatat 120 

35 gttcaaagcg ttgaagtcaa aaacggcgtc gttaccgcca caatggcttc aagcaacgta 180 
aacaacgaaa tcaaaggcaa aaaactctcc ctgtgggcca agcgtcaaga cggttcggta 240 
aaatggttct gcggacagcc ggttaagcgc aacgacaccg ccaccaccaa cgacgacgtc 300 
aaagccgaca ccgccgccaa cggcaagcag atcgacacca agcacctgcc gtcaacagca 0 360 
tcgacaagaa aatcgacacc aaactag 387 

40 <212> Type : DNA 

<211> Length : 387 

SequenceName : SEQ ID 627 
SequenceDescription : 

45 Sequence 



<213> OrganismName : Neisseria m 
<400> PreSequenceString : 
atgcccatcc cctttaaacc cgtattggct 

50 tttgcggcag accccgcgcc gcagtccgcc 
acgcacaaaa cccaaaaact cggcgaagaa 
ttggtcaacg acgaacacga cctggtgcgc 
ggcagggcgg gttctaacgg ctttaccata 
aacgttgacg ggctggcgca ggcggaaagc 

55 ggcgcgtacg gcaacttcaa cgccaaccgc 
gtaaccatca ccaaaggcgc ggactcgctc 
gtcaattacc aaaccaaatc cgcaagcgat 
gggataaagg gcggcagcgt cggcaaaaac 
ggcaggctct ttggtttgga tgccttattg 

60 aaaaaccgct cgaccgaggg cgatatcgaa 
accgatacag gcggacccag caagtacctg 
caacccgacc cgcaagaatg ggtaaacaaa 
aacgatcaaa accgtatcgg ctggattttt 
gagctgtcta atttgtggac gggtacgacc 

65 cgccaagacg tgagctaccg ccgccgctcc 
ggcccgtggg acagccttaa gctgcgctac 
acttgggaca tcccgaaaaa ttacgataaa 



jningitidis serogroup A strain Z2491 

gccgccgcca tcgcccaagc gtttcccgcc 60 

caaacgctga acgaaatcac cgttaccggc 120 

aaaatccgcc gcaaaacttt agacaagctc 180 

tacgaccccg gcatttccgt cgtcgaaggc 240 

cgcggcgtgg acaaagaccg cgtcgccatc 3 00 

cgctcttccg aagccttcca agaattgttc 3 60 

aacacttccg agccggaaaa cttttccgaa 420 

aaatccggca gcggcgcatt gggcggcgca 480 

tatgtttccg aagacaagcc ctaccatttg 540 

agccaaaaat tcagcagcat caccgccgcc 600 

gtttataccc gccgcttcgg caaagaaacc 660 

attaaaaacg acggatatgt ctataacccg 720 

acctatgtag ccacaggggt tgcgcgctcc 780 

agcaccctgt tcaagctggg ctacaacttc 840 

gaagactcgc gcaccgaccg ttttaccaac 900 

acgtctgccg caacgggcga ctaccgccac 960 

ggtgtcgaat acaaaaacga attggaacac 1020 

gacaagcagc gcatcgatat gaacacttgg 10 80 

agaggcatca acggcgaggt ttaccattcg 1140 
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ttccggcata tccgccaaaa caccgcgcaa tggactgccg attttgaaaa acaactcgac 120 0 

ttttccaaag ccgtttgggc ggcgcaatac ggcttgggcg gcggcaaagg ggacaatgcc 1260 

aactcggatt acagctattt cgcaaaactg tacgacccca aaatcctcgc ttccaaccaa 1320 

gccaaaatca caatgctgat cgaaaaccgg tcgaaatata aatttgccta ttggaacaat 138 0 

gcgtttcact tgggcggcaa cgaccgcttc cgcctgaatg cgggcatacg ctacgacaaa 1440 

aacagcagca gcgcgaaaga cgatccgaaa tacaccaccg ccatccgggg gcagattccc 1500 

catttgggtt cggaacgcgc gcacgcgggc ttcagctacg gcacggggtt cgactggcgg 1560 

tttaccaagc atctgcactt gttggcaaaa tacagcaccg gcttccgcgc accgacttcg 162 0 

gacgaaactt ggctactgtt cccacacccc gatttctacc tgaaagccaa cccaaacctg 168 0 

aaagccgaaa aagccaaaaa ctgggaattg ggtctggcgg gcagcggcaa agcgggcaac 174 0 

ttcaagctct cgggcttcaa aaccaaatac cgcgacttta tcgaattgac gtatatgggc 1800 

gtttcgtcag acgataaaaa caaccccaga tacgccccgc tttcagacgg tacggcattg 1860 

gtcagctcgc ccgtttggca aaaccaaaac cgctccgccg cctgggtgaa aggcatagag 192 0 

tttaacggqa cgtggaacct cgacagtatc ggtttgccca aagggctgca caccggcctc. 198 G 

aacgtcagct acatcaaggg caaggcaacg caaaacaacg gcaaagaaac gcccatcaac 2 040 

gcgctttcgc cgtggacggc ggtttacagc ctgggctatg acgcgccttc caaacgctgg 210 0 

ggcatcaacg cctacgccac gcgcaccgcc gccaaaaagc cgtccgacac cgtccacagc 2160 

aacgacgact tgaacaaccc gtggccttat gccaaacaca gcaaggccta tacgctgttc 222 0 

gacctttccg cctacctcaa catcggcaaa caggttacgc tccgcgccgc cgcgtacaac 228 0 

attaccaaca agcagtacta cacttgggaa tctttacgca gcatccgcga gttcggcacg 234 0 

gtcaaccgcg ttgacaacaa aacccacgcc ggcatccaac gctttacctc gccgggcagg 240 0 

agctacaatt tcaccatcga agcgaagttc taa 2433 
<212> Type : DNA 
<211> Length. : 2433 

SequenceName : SEQ ID 628 

SequenceDe script ion : 

Sequence 



<213> OrganismName : Neisseria meningitidis serogroup A strain Z2491 
<400> PreSequenceString : 

atgaaaaaat ccctgattgc cctgactttg gcagcccttc ctgttgcagc aatggctgac 
gttaccctgt acggcaccat caaaaccggc gtagaaactt cccgctctgt agaacacaat 
ggaggtcagg tggttagcgt tgaaaccggt accggcatcg ttgatttggg ttcgaaaatc 
ggcttcaaag gccaagaaga cctcggtaac ggcctgaaag ccatttggca ggttgagcaa 
aaggcatcta tcgccggcac tgactccggt tggggcaacc gccaatcctt catcggcttg 
aaaggcggct tcggtaaatt gcgcgtcggc cgtttgaaca gcgtcctgaa agacaccggc 
gacatcaatc cttgggatag caaaagcgac tatttgggtg taaacaaaat tgccgaaccc 
gaagcacgcc tcatttccgt acgctacgat tctcccgaat ttgccggcct cagcggcagc 
gtacaatacg cgcttaacga caatgtaggc agacataaca gcgaatctta ccacgccggc 
ttcaactaca aaaacggcgg cttcttcgtg caatatggcg gtgcctataa aagacatcag 
gatgtggatg acgtgaagat tgagaaatac cagattcacc gtttggtcag cggttacgac 
aatgatgccc tatacgcttc cgtagccgta cagcaacaag acgcgaaact ggttgaagac 
aattcgcaca actctcaaac cgaagttgcc gctaccttgg cataccgctt cggcaacgta 
acgccccgcg tttcttacgc ccacggcttc aaaggctcgg ttgatgatgc aaaacgcgac 
aatacttacg accaagtggt tgtcggtgcg gaatacgact tctccaaacg cacttctgcc 
ttggtttctg ccggttggtt gcaagaaggc aaaggcgaaa acaaattcgt agcgactgcc 
ggcggtgtcg gtctgcgcca caaattctaa 
<212> Type : DNA 
<211> Length : 990 

SequenceName : SEQ ID 629 

SequenceDe script ion : 

Sequence 



<213> OrganismName : Neisseria meningitidis serogroup A strain Z2491 
<4 00> PreSequenceString : 

atgaaaaccc tgctcctcct catccccctc gtcctcacag cctgcggcac actgaccggc 60 

atacccgccc acggcggcgg caaacgcttt gccgtcgaac aagaactcgt cgccgcatcg 120 

tcccgcgccg ccgtcaaaga aatggacttg tccgccctga aaggacgcaa agccgccctt • 180 

tacgtctccg ttatgggcga ccaaggttcg ggcaacataa gcggcggacg ctactctatc 240 

gacgcactga tacgcggcgg ctaccacaac aaccccgaaa gtgccaccca atacagctac 300 

cccgcctacg acactaccgc caccaccaaa tccgacgcgc tctccagcgt aaccacttcc 360 

acatcgcttt tgaacgcccc cgccgccgcc ctgacgaaaa acagcggacg caaaggcgaa 420 

cgctccgccg gactgtccgt caacggcacg ggcgactacc gcaacgaaac cctgctcgcc 480 

aacccccgcg acgtttcctt cctgaccaac ctcatccaaa ccgtcttcta cctgcgcggc 540 

atcgaagtcg taccgcccga atacgccgac accgacgtat tcgtaaccgt cgacgtattc 600 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
990 
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ggcaccgtcc gcagccgcac cgaactgcac ctctacaacg ccgaaaccct taaagcccaa 660 

accaagctcg aatatttcgc cgttgaccgc gacagccgga aactgctgat tgcccctaaa 72 0 

accgccgcct acgaatccca ataccaagaa caatacgccc tctggatggg accttacagc 78 0 

gtcggcaaaa ccgtcaaagc ctcagaccgc ctgatggtcg atttctccga catcaccccc 840 

tacggcgaca caaccgccca aaaccgtccc gacttcaaac aaaacaacgg taaaaaaccc 9 00 

gatgtcggca acgaagtcat ccgccgccgc aaaggaggat aa 942 
<212> Type : DNA 
<211> Length : 942 

SeguenceName : SEQ ID 63 0 

SequenceDescription : 



Sequence 



«*213> OrganismName - Neisseria meningitidis ccro^ovp a ^t^^in Z?,49i 
<4 00> PreSequenceString : 

atgaataaaa ccttgtctat tttgccggtg gcaatcttac tcggcggctg cgccgccggg 60 

ggcggtaaca cattcggcag cttagacggc ggcacaggta tgggcggcag catcgtcaaa 120 

atggcggtag aaagccaatg ccgtgcggaa ttgaacaaac gcagcgaatg gcgtttgacc 180 

gcgctggcga tgagtgccga aaaacaggcg gaatgggaaa acaagatttg cgcttgcgtc 240 

gcccaagaag cacccaacca gctgaccggc aacgatgtga tgcagatgct ggatccgtcc 3 00 

acgcgcaatc aggcacttgc cgccctgacc gccaaaacgg tttccgcctg cttcaaacac 3 60 

ctgtaccgct aa " 372 

<212> Type : DNA 
<211> Length : 372 

SequenceName : SEQ ID 631 

SequenceDescription : 



Sequence 



<213> OrganismName : Neisseria meningitidis serogroup A strain Z2491 
<400> PreSequenceString : 

atgaatccac ttattcatca agcaaaggaa tcatctatgc aaacccgcat cctctccgcc 60 

gtactgctgg ctttttcaac cgctgccttt gccgggggcg cattcacgct gcaattcgac 12 0 

aacccgtccg aagacggcgg cttcacgcaa aaccagattt tgagcgcgcc ttacggcttt 18 0 

ggctgttcgg gcggcaatgc ttcgcccgcg ctgtcgtgga aaaatccgcc cgccgggaca 240 

aaaagtttcg tcctgaccgt ttacgataaa gacgcgccga ccggactggg ctggatgcac 3 00 

tgggtggtcg ccgacattcc cgccgatgtc cgccgccgca atgcgacctc gctgcaatta 360 

agccgctgcg ccagcatcgc cgacgaccag tccgcagcca tatcggcagt aatcagtttg 42 0 

cagatttgcc gcatcaggtt gacgccttcg tacacggcaa aaccgatgcc gtcatgctgc 48 0 

aaccacgcca acacgccgca aagcgcggcc tccgccgcat tgtgcggcac ttcttcatcc 54 0 

gtcagcaccg ccgccgcata a 561 
<212> Type : DNA * 
<211> Length : 561 

SequenceName : SEQ ID 632 

SequenceDescription : 



Sequence 



<213> OrganismName : Neisseria meningitidis serogroup A strain Z2491 
<400> PreSequenceString : 

atgaataaaa ctttaaaaag gcgggttttc cgccataccg cgctttatgc cgccatcttg 60 

atgttttccc ataccggcgg gggggggggg gcgatggcgc aaacccgtca atacgctatt 120 

atcatgaacg agagaaacca gcccgaggta cagtggaatg ggtcatattc aataaaggac 18 0 

aaagacagga agcgcgaata tactcatcat aatcaccaac aaggaggaag ctctgtctca 240 

ttcaacaata gcgatgagct tgtttctcga caaagcggta ctgccgtttt tggcacagcc 3 00 

acctacctgc cgccctacgg caaggtttcc ggttttgatg ccgccgctct gaaagagcgc 360 

aacaatgccg tcgattggat tcataccacc cacccagggt tgataggcta cagctacgac 42 0 

ggtgtcgtat gcagaagcgc cacagactgt cccaaacttg tctataaaac ccgattttcc 480 

ttcgataatc ccgacttggc aaaaacagga ggcgggttgg ataagcacac agagccaagc 540 

cgcgacaatt cgcccattta caaattgaag gatcatccat ggttgggcgt gtctttcaat 600 

ttgggtgccg agggtatcgc caaaaatggt aagacaatca acaaattggt atcttctttt 660 

aatgaaaaga atagtaataa caacctcgtc tataccacgg aaggccgcga tatttccttg 72 0 

ggcaactggc agcgcgaaac gaccgccatg gcctattatc tgaacgccaa gctgcacctg 780 

ctggataaaa aacagattca aaatatcacc gacaaaacag tgcagttggg tgtcttgaag 840 

ccgagcatcg atgtgcggac aagaaatacg gggactgccg gcattctatc ttattgggct 90 0 

aagtgggaca ttaaagatac cgggcagatt ccagtcaagc tcagcttgac gcaagtcaaa 960 

gcaggccgct gcgtcaacaa agataacccc aataagaata ccaaaacctc ttcccccgca 1020 
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ctgactgccc ccgcgctgtg gttcggagct gggcaagatg gtaaggcgga gatgtattcc 1080 

gcttcggttt ccacctaccc cgacagttcg agcagccgca tcttccttca aaatctgaaa 1140 

agaaaaaccg acaccagcag acccggccgc tattccctcg caaccttgaa taagtcggat 12 00 

attgaaagtc gagagccgag tttcacaagt cggcaaaccg tcatccgatt ggatggcggc 1260 

gtacagcaga tcaaactgga tagaaacaat actgaggtca ctggttttaa tggaaatgac 1320 

ggcaaaaacg acactttcgg cattgttagt gaagggagct tcatgcctga tgccagcgag 13 80 

tggaaaaaag tattgctgcc ttggacggtt cgtgctttca attatgacgg tcgatttaac 1440 

acagtcaaca aagaagaaaa caacggcaag ccaaaataca gtcaaaaata ccgcagccgc 1500 

aacaacggca agcacgagcg caatttgggc gacatcgtca acagccccat cgtggcggtc 1560 

ggcgagtatt tggctacttc cgccaacgac gggatggtgc atatcttcaa acaaagcggc 1620 

ggggacaagc gcagctacaa tctgaagctc agctacatcc ccggcacgat gccgcgcaag 168 0 

gatattgaaa gcaaagactc cacccttgcc aaagagctgc gcgcctttgc cgaaaaaggc 1740 

tatgtgggcg accgctacgg cgtggacggc ggctttgtct tgcgccgcat tacagatgac 18 0 0 

cap.gacaagc aaaaacactt ctttatgttc ggcgcaatgg gccttggcgg cagaggcgca 18 60 

tacgccttgg atttgaccaa agccgacgac aatgacccga caaaagcctc tttgtttgat 1920 

gtaaaagata acggcaataa tggcaataac ggcaataatc gcgtggaatt aggctacacc 1980 

gtcggcacgc cgcaaatcgg caaaacccac aacggcaaat acgccgcctt cctcgcctcc 2040 

ggttatgcga ctaaacagat tgacagcggc gagaataaaa ccgcgctgta tgtgtatgat 2100 

ttggaaagca acaacggtac gctgattaga aaaatcgaag taaccgacgg caagggcggg 2160 

ctttcgtccc ccacgctggt ggataaagat ttggacggca cggtcgatat cgcctatgcc 2220 

ggcgatcgcg gcggcaagat gtaccgcttt gatttaagcg gcaacaaccc gaacagttgg 2280 

actgtacgca ctattttcca aggcacgaag ccgattactt ccgcgcccgc catttcccaa 2340 

ctgaaagaca aacgcgtggt tatcttcggt acgggcagtg atttgagtga ggatgatgta 2400 

ctcagtacgg atgaacaaca tatttacggt atttttgaca atgacacaaa cacgggtacg 2460 

gcgcaagagg ggctgggcaa agggctgctc gagcaaaagc ttagtgagga aaataaaacc 2520 

ttattcctga ccgattataa gcgatccgac ggctcgggcg acaagggctg ggtagtgaaa 258 0 

ttgaaggacg gacagcgcgt taccgtcaaa ccgaccgtgg tattgcgtac cgcctttgta 2640 

accatccata aatatacggg taatgacaaa tgcggcgcgg aaaccgccat tttgggcatc 2700 

aataccgccg acggcggcaa gctgaccaag aaaagcgcgc gcccgattgt gccggcagcc 2760 

aattcgaagg tcgcgcaata ttccggcgat aagaaaactt ccagcggcaa atccatccct 282 0 

ataggttgta tggaaaaaga cgggggaacc gtctgcccga acggatatgt ttacgacaaa 2880 

ccggttaatg tgcgctacct ggacgaaaag aaaacagacg gattttcaac aacggcagac 2940 

ggcgatgcgg gcggcagcgg aacattcaaa gagggtaaaa aacccgcccg caataaccgg 3 000 

tgcttctccg gaaaaggggt gcgcaccctg ctgatgaacg atttggacag cttggatatt 3060 

accggcccga tgtgcggtat gaaacgaatc agctggcgtg aagtcttcta ctga 3114 

<212> Type : DNA 
<211> Length : 3114 

SequenceName : SEQ ID 633 

SequenceDe script ion : 

Sequence 



<213> OrganismName : Neisseria meningitidis serogroup A strain Z2491 
<400> PreSequenceString : 

atgaaacacc ccaaactcac ccttatcgcc gcattgctga ccactgccgc aactgccgcc 60 

cccctgccgg ttgtaaccag cttcagcatt ttaggcgacg tagccaaaca aatcggcgga 12 0 

gagcgcgtat ccatacaaag tttggtcgga gccaaccaag atacgcacgc ctatcatatg 18 0 

accagcggcg acattaaaaa aatccgcagt gcaaaactcg tcctgattaa cggcttagga 240 

cttgaagctg ccgacatcca acgtgccgtc aaacagagca aagtatccta tgccgaagcg 300 

accaaaggca tccaacccct caaagccgaa gaagaaggcg gacaccatca cgaccacgat 3 60 

catgaccacg accatgacca cgaaggacac caccacgacc acggcgaata tgacccccac 42 0 

gtctggaacg accccgtcct tatgtccgcc tatgcccaaa acgtcgccga agccctgata 48 0 

aaggccgacc ccgaaggcaa agtttattat caacaacgct tgggcaacta ccaaatgcag 540 

ctcaaaaaac tgcacagtga cgcacaagcc gcatttaatg ccgtccctgc cgccaaacgc 600 

aaagtcctga ccgggcacga tgccttttcc tatatgggca aacgttacca tatcgaattc 660 

atcgccccac aaggtgtgag cagcgaagcc gagccttcag ccaaacaagt cgccgccatc 72 0 

atccgacaaa tcaaacgcga aggcatcaaa gccgtattta ccgaaaatat caaagacacc 780 

cgcatggttg accgcatcgc caaagaaacc ggtgtcaacg tcagcggcaa actgtattcc 840 

gacgcactcg gcaacgcacc cgcagacacc tacatcggca tgtaccgcca caacatcaaa 90 0 

gccttaacca acgcgatgaa gcaataa 927 
<212> Type : DNA 
<211> Length : 927 

SequenceName : SEQ ID 634 

SequenceDescription : 



Sequence 
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<213> OrganismName : Streptococcus pyogenes strain MGAS8232 
<400> PreSequenceString : 

atgaaaaaaa gaattttatc agcagttctt gtaagtggtg ttaccctcgg agcagctaca 60 

5 actgtaggag cggaggattt gagtactaag attgctaagc aggattctat tatctcaaat 120 

ctgactacag agcaaaaagc tgcacagaat caagtttcag cgttacaggc tcaagtaagt 180 

tcactacaat ctgaacaaga taaactgacc gcaagaaata cagaacttga ggcactttca 240 

aagcgatttg agcaagaaat taaggctcta acaagtcaaa ttgttgctcg taatgaaaaa 3 00 

ttaaaaaatc aagctcgtag tgcttataaa aacaatgaaa cttctggtta cattaatgca 3 60 

10 cttttgaatt ctaaatcaat ttctgatgtt gtaaaccgtt tagtagcaat taatagagct 42 0 

gtctctgcta acgctaaatt gttagaacaa caaaaagctg ataaagtttc ccttgaagaa 480 

aagcaagctg ctaaccaaac agctattaat accattgccg ctaatatggc aatggctgaa 540 

gaaaaccaaa atacattacg tactcaacaa gctaatttgg aagctgcaac tgcaaattta 600 

gctctccaat tagcatctgc tactgaagat aaagctaatt tggtagctca aaaagaagct . . 660 

15 gcagaaaaag ctgctgctga agccttagca caagaacagg ctgctaaagt taaggcacaa 720 

gaacaggctg cacaacaagc agcatctgtt gaagcagcaa aatctgctat tactccagca 780 

ccacaagcta ctccggcagc gcaaagtagt aatgctattg aaccagctgc actcacggct 84 0 

ccggcagctc cttctgcaag accacaaaca tcatatgatt cttctaatac ttatccagtt 9 00 

ggacaatgca catggggagc taaatcttta gctccttggg caggaaataa ttggggaaat 960 

20 ggtggtcaat gggcttatag tgctcaagca gctggttatc gtactggttc aacgccgatg 1020 

gtaggtgcga ttgccgtttg gaacgatggt ggttatggac atgtcgccgt tgtagttgag 1080 

gttcaaagtg cctcaagtat tcgtgtgatg gagtctaact acagtggtag acagtacatt 1140 

gctgatcacc gtggttggtt taatccaaca ggtgttacat ttatttatcc acactaa 1197 

25 <212> Type : DNA 

<211> Length : 1197 

SequenceName : SEQ ID 635 
SequenceDescription : 

30 Sequence 



40 



<213> OrganismName : Streptococcus pyogenes strain MGAS8232 
<40 0> PreSequenceString. : 

gtgattacaa ttaaaaatcc aaaaatcctt aagtggctaa agtatgtatt aagtgcaatt 60 

35 cttagcctta ttatccttgt tattattatt ggtggtcttt tgtttacctt ctacattagc 120 
agtgctccga aactgtcaga agcccagtta aaatcaacaa actctagctt ggtttatgac * 180 

ggtaataaca atctgattgc tgatttgggt tctgaaaagc gtgaaaatgt aacagctgat 240 

agtatcccta ttaatctagt taatgctatt acctcaattg aagataaacg tttctttaac 3 00 

catcgtggag tagatcttta tcgtattttt ggtgctgcct ttcataatct aacgagtcag 3 60 

accactcaag gggggtcaac gcttgatcag caactcatta aactagccta tttttctact 420 

aatgaatctg atcaaacctt aaaacgtaag gctcaagaag tttggcttgc tcttcaaatg 480 

gagcgaaaat atactaaaca agaaatcctg actttttaca tcaacaaagt atatatgggt 540 

aatggcaact atggtatgct gacagccgct aagtcttatt atggcaagga tcttaaggat 600 

ttatcttatg cccagctagc cctattggct ggaatccctc aagctcctag tcaatatgat 660 

45 ccttaccttc atcctgaagc tgctcaaaat cgccgtaacg tcgtgttgca acagatgtac 720 

atggaaaaac atctgacgaa agcagaatat gaaactgcca tcgcaactcc cgtcgctgaa 780 

ggtctacaat cactccaaca gcgctcaact tatccaaaat atatggataa ttatctaaaa 840 

caagttattg aagaagtcaa aaaggaaacg aataaagata tttttaccgc tggtttaaaa 900 

gtttatacca atattatccc cgatgcgcag cagactcttt ataatattta tcattctggt 960 

50 gattatgttt actatccaga ccaagatttc caagttgctt caacgattgt tgatgtgaca 1020 

aatggtcatg ttattgctca gcttggcgga cgtaatcagg atgaaaatgt ttcatttggg 1080 

actaaccaag ctgttttaac tgatcgtgac tggggttcta ccatgaagcc aatcacagcc 1140 

tatgctcctg ctattgaatc tggtgtttat acttctactg ctcagtcgac taatgactca 1200 

gtctattatt ggcctggaac cactacccaa ttgtttaact gggaccttag atataacgga 1260 

55 tggatgacaa tccaagctgc tattatgcta tcgcgaaatg tcccagcagt ccgagcactg 132 0 

gaagccgcag gacttgacta tgctcgatct ttcttaagca gtttaggtat taactatccc 13 80 

gaaatgcact actcaaacgc tatctcaagt aataacagta gctcagataa aaaatatggt 1440 

gcaagtagtg aaaaaatggc cgctgcatac gctgcttttg caaatggtgg tatttatcat 1500 

aaaccaaggt atgtcaataa agtggaattt agtgatggta caagtaaaac atttgatgaa 1560 

60 aaaggaaaac gtgccatgaa agaaaccacg gcctatatga tgacagatat gttgaaaact 162 0 

gttctcactt atggtacagg tactgctgct gccattcctg gtgttgcgca agctggtaaa 1680 

acagggactt ctaactacac tgatgaggaa ctagctaaaa ttggtgaaaa atacggcctt 1740 

tatccagatt atgttggtac attagcgcca gacgaaaact ttgttggctt tactaagcgc 1800 

tacgccatgg ctgtttggac aggttacaaa aaccgcttga ccccagtata cggatcaagt 1860 

65 ctagagattg catctgacgt ttatcgtagc atgatgactt acttaacaaa tggttacagt 1920 

gaagactgga ccatgccaaa tggtctttat cgcagtggtg gattcctcta cttaagcgga 1980 

acctatgcga gcaacaccga ctatactaat tcggtttaca acaatcttta cagcaataac 2040 
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acaacaacag cttctagcca aacgacttca gatgatacta gtagtagcaa tgatacaagt 210 0 
aattcaacca atacagacaa caatggcagc catccatcta ccgatgataa aaagacaact 2160 
cattaa 2166 
<212> Type : DMA 
<211> Length : 2166 

SequenceName : SEQ ID 636 

SequenceDe script ion : 

Sequence 



<213> OrganismName : Streptococcus pyogenes strain MGAS8232 
<4 00> PreSequenceString : 

atgattatta ctaaaaagag cttatttgtg acaagtgtcg ctttgtcgtt agcacctttg 60 
gtgacagcgc aggcacaaga gtggacacca- cgatcggtta cagaaatcaa gtctgaactc _ JL20 

15 gtcctagttg ataatgtttt tacttatact gtaaaatacg gtgacacttt aagcacaatt 180 

gctgaagcaa tgggaattga tgtgcatgtc ttaggagata ttaatcatat tgctaatatt 240 

gacttaattt ttccagacac gatcctaaca gcaaactaca accaacacgg tcaggcaacg 3 00 

actttgacgg ttcaagcacc tgcttctagt ccagctagcg ttagtcatgt acctagcagt 360 

gagccattac cccaagcatc tgccacctct caatcgactg ttcctatggc accatctgcg 42 0 

20 acaccatctg atgttccaac gacaccatta gcatctgcaa agccagatag ttttgtgaca 480 

gcgtcatctg agctcacatc gtcaacgaat gatgtttcga ctgagttgtc tagcgaatca 540 

caaaagcagc cagaagtatc acaagaagca gttccaactc ctaaagcagc tgaaacgact 600 

gaagtcgaac ctaagacaga catctcagaa gatccaactt cagctaatag gcctgttcct 660 

aacgagagtg cttcagaaga agcttcttct gcggccccag cacaagctcc agcagaaaaa 720 

25 gaagaaacct ctcagatgtt aactgcgcca gcggcacaaa aagctgtagc tgacaccaca 780 

agtgttgcaa cctcaaacgg cctttcttac gctccaaacc atgcctacaa tccaatgaat 840 

gcagggcttc aaccacaaac agcagcgttc aaagaagaag tggcttctgc ctttggtatt 900 

acgtcgttta gtggttaccg tccaggagat ccaggagatc atggtaaagg attagccatt 960 

gactttatgg taccggttag ctctacgctt ggtgatcaag ttgctcaata tgccattgac 1020 

30 catatggcag agcgtggtat ttcatacgtt atttggaaac agcgattcta tgcgccattt 10 80 

gcaagtattt acggaccagc ctatacatgg aaccccatgc cagatcgcgg cagtattaca 1140 

gaaaaccatt atgatcatgt tcatgtctcc tttaatgctt aa 1182 
<212> Type : DNA 
<211> Length : 1182 

35 SequenceName : SEQ ID 637 

SequenceDescription : 

Sequence 



40 <213> OrganismName : Streptococcus pyogenes strain MGAS8232 
<400> PreSequenceString : 

atgaaaaaga aaattctttt aatgatgagt ttaatcagtg tcttttttgc ttggcaactt 60 

actcaggcaa aacaagtctt agcagagggt aaagtgaagg tggtgacaac tttctaccct 120 

gtttatgaat ttacaaaagg ggttattggt aatgatggcg atgtttccat gcttatgaaa 180 

45 gcaggaacgg aacctcatga ttttgagcct tctacaaaag acattaaaaa aatccaagat 240 

gcagatgcat ttgtttatat ggatgacaat atggaaactt gggtttctga tgtgaaaaaa 300 

tcattgacat ctaaaaaagt gaccatcgtc aagggaactg gtaacatgct cttggtagca 360 

ggagctggac atgaccatca ccatgaggat gctgacaaaa agcatgagca taataaacat 42 0 

agcgaagaag gacacaacca tgcttttgac ccacacgtgt ggttgtcacc ataccgtagc 480 

50 attacggtcg ttgaaaatat tcgcgacagt ctttcaaaag cttacccaga aaaagcagag 540 

aacttcaaag ccaatgccgc tacttatatt gaaaaattaa aagagcttga caaagactat 600 

acggcagcac tttcagatgc taagcaaaag agctttgtga cacaacacgc agcttttggt 660 

tatatggcac ttgactatgg cttgaaccaa atttctatta atggtgtcac accagatgca 720 

gaaccatcag caaaacgtat tgctactttg tcaaaatacg ttaaaaaata tggcatcaaa 780 

55 tacatttatt ttgaggaaaa tgcgtcaagt aaagtcgcaa aaaccctagc taaagaagca 840 

ggagttaaag cggctgtgct tagtccgctt gaaggtttga ctaaaaaaga gatgaaagct 900 

ggccaagatt actttacggt catgcgtaaa aaccttgaaa ccttacgctt aaccactgat 960 

gtggctggta aagaaattct tccagaaaaa gacacgacta agacagttta caatggttat 1020 

ttcaaagaca aagaagtcaa agatcgtcaa ttatctgact ggtcaggtag ctggcaatct 1080 

60 gtttacccct atttacaaga tggtacttta gaccaagttt gggactataa ggctaaaaaa 1140 

tctaaaggta aaatgacagc agccgagtac aaagattact acactactgg ttataaaact 12 0 0 

gacgtggaac aaatcaaaat caatggtaag aaaaagacca tgacctttgt tcgtaatggt 1260 

gaaaagaaaa ccttcactta cacatacgcc ggcaaagaaa tcttgaccta tccaaaagga 1320 

aatcgcggag ttcgtttcat gtttgaagct aaagaaccaa atgctggcga attcaaatac 13 80 

65 gttcaattca gtgaccatgc cattgctcct gaaaaagcag agcatttcca cctgtactgg 1440 

ggtggtgaca gccaagaaaa attacataaa gagttagaac attggccaac ttactacggt 150 0 

tcagacttat ctggtcgtga aattgcccaa gaaatcaatg ctcattaa 1548 
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<212> Type : DNA 

<211> Length : 1548 

SequenceName : SEQ ID 638 
SequenceDescription : 

5 

Sequence 



<213> Organi smName : Streptococcus pyogenes strain MGAS8232 
<400> PreSequenceString : 

10 atgaaaaaat ttcatcgttt tttggtctca ggagtaatcc ttttaggttt taatggtcta 60 

gtacctacta tgccatctac acttatttcg caacaggaaa atcttgttca tgcagctgtt 12 0 

ttaggcgata actatccgag taagtggaaa aaaggcaatg gaatcgattc gtggaacatg 180 

tatatccgcc aatgcacttc ttttgcagct tttcgtttaa gctctgctaa tggttttcag 240 

ttacctaaag gctacggtaa tgcctgcacg - tggggacata tcgegaaaaa tcagggttat 3JPIA. 

15 cctgtgaata agacaccaag cataggggct atcgcttggt ttgataaaaa cgcttatcag; 3 60 

tcaaatgctg cttacgatca tgtagcatgg gtagctgata tccgtggaga cactgtcact 42 0 

atcgaagagt ataattacaa cgctggacaa ggccctgaaa gataccataa gcgtcaaatt 480 

ccaaaatctc aggtaagtgg ttatatccat tttaaagact tatcatctca gacaagtcat 540 

tcctacccaa gacaactaaa acacatttct caagcttcat ttgacccctc tggaacttat 60 0 

20 cactttacaa ccagattacc agtcaaagga caaaccagta tcgatagccc tgatcttgct 660 

tactatgaag caggtcaatc tgtttattac gataaagtcg tgactgctgg aggttataca 72 0 

tggcttagct acctcagttt ttctggaaac cgacgctata ttcccattaa agagcccgca 780 

cagtctgtgg ttcaaaatga caatacaaaa ccttccatta aggtcggtga tactgttacc 840 

ttccctggcg tttttcgtgt agatcagctt gttaataatt tgatcgttaa taaagaatta 9 00 

25 gccggaggag acccaactcc actaaactgg attgatccca caccattaga tgaaacagat 960 

aaccaaggaa aagttttagg aaatcaaatt ctccgtgtgg gtgaatattt taccgtcact 102 0 

ggtagttata aagtattaaa aattgatcaa ccaagtaatg gtatttatgt tcaaatcgga 1080 

tctcgtggaa catgggtaaa tgctgataaa gctaacaaat tatag 1125 
<212> Type : DNA 

30 <211> Length : 1125 

SequenceName : SEQ ID 639 
SequenceDescription : 



Sequence 

35 

<213> OrganismName : Streptococcus pyogenes strain MGAS8232 
<400> PreSequenceString : 

atgttaaaat ttacttcaaa tattttagct actagtgtag ctgaaacaac tcaagttgct 60 
cctggaggct gctgttgctg ctgtactact tgttgcttct caattgctac tggaagtggt 120 
40 aattctcaag gtggtagcgg aagttatacg ccaggtaaat aa 162 
<212> Type : DNA 
<211> Length : 162 

SequenceName : SEQ ID 640 

SequenceDescription : 

45 

Sequence 



<213> OrganismName : Streptococcus pyogenes strain MGAS8232 
<400> PreSequenceString : 

50 atgggagaat cttattctgt tgaagcggtt ttgacagctg ttgataaaac ctttggcaaa 60 

acattacaat cggcaatccg ttcaatcgaa ggcttggaaa agcgttcaac cggtttttca 120 

tcggtgtctc aaaaagctag ttccatgttt aaatccatgt taggagcgaa tttagctgga 180 

caagctatct cggcaatgac aaggacagtg tcatcaggcc ttggctctat gcttggcgag 240 

atgaatagtt cagcgaaagc gtggaaaact tttgacgcta atttagcgga cattgggttt 3 00 

55 ggaaaaaaac aaattttggc agttaaaacg gcgatgcaag actatgcaac taaaacaatc 360 

tactcggcat cagatatggc tagcacgtat gcacagttag cagcagttgg cgtgaaagat 42 0 

accggaaagc tcgtaaaagc ttttggcggt ttagctgcat ctgctgaaaa tccgaagcag 480 

gctatgaaat caattagtca gcaaatgaca caagctgttg gaagaccaac agttgcatgg 540 

caagacttta ggataatgtt ggaacagacg cctgcaggga tggctaaagt cgctaaatct 600 

60 atgggtaaaa atcttgatga actcgtcgcc gatatccagg cgggtagggt taaaaccagc 660 

gattttttgg aagcggtaaa aaaagcaggc aatgataaga gtttccaaaa gatggcaact 720 

gagttcaaaa ctgttgacca agccatcgac ggtatgcgag aaggcttatc caacaaattg 780 

caaccagcgt ttgaaaaagt gaaccaattt ggaattagag cgatcgaagc aatcggtaaa 840 

caactcgata aagttgattt ttctaagttt gctagtaatc ttgggaaatt ccttgaagga 900 

65 attaatatcg ataaaattgt atctaatatt tcatcggcgg tttcatctgt cacttcaaag 960 

gttaaagaat tttgggacgg tttcaaacaa actggagcaa ttagtgcttt ttcaggagct 1020 

ttgcagagcg tttggggagc tttaaaaaat gtcgctagcg ccatgagcgg agggaattgg 10 80 
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aagacttttg gagcaacagt tggagggatt gttaaacacg tctctaactt cgctaaagct 1140 

gtttccgatg ttttaggaaa gatggaccct ggcagactaa gaagtfcggat agctaccttc 1200 

gccgcagtag ctggaggttt taagttattc gaaaaattaa cgggacaaag cgtcattggt 1260 

tcttttttgg ataaaattgg cagcaaattt ggtctctttg gaaacaaagc caaagaagga 132 0 

acagacaaag cctctaacgg cgctagaaga agcggtggca ttattagcca aatcttcagc 13 80 

ggcttgggta atatcgttaa gtctgctggt acagccatat caacagctgc aaaaggtatc 1440 

ggagttggta ttaaaactgc tttgtctgga atccccccct atcattag 1488 
<212> Type : DNA 
<211> Length : 1488 

SequenceName : SEQ ID 641 

SequenceDescription : 



Sequence 



<213> OrganismName : Streptococcus pyogenes strain MGAS8232 
<400> PreSequenceString : 

atgaaaaaag gtttttttct catggttatg gtcgtgagtt tagtaatgat agcagggtgt 6 0 

gataagtcag caaaccccaa acagcctacg caaggcatgt cagttgtaac cagcttttac 12 0 

ccaatgtatg cgatgacaaa agaagtatct ggagacctaa atgatgtgag gatgatccaa 18 0 

tcaggtgcag gcattcattc ctttgaaccg tctgtaaatg atgtagcagc tatttatgac 240 

gcggatttgt ttgtttacca ttcacatacc ttagaagctt gggcaaggga tctagaccct 3 00 

aatttaaaaa aatcaaaggt tgatgtgttt gaagcgtcaa aacctttgac actagataga 3 60 

gtcaaagggc tagaagatat ggaagtcaca caaggcattg accctgcgac actttatgac 42 0 

ccacatacct ggacggatcc cgttttagct ggtgaagaag ctgttaatat cgctaaagag 48 0 

ctaggacgtt tggatcctaa acacaaagac agttacacta aaaatgctaa ggctttcaaa 540 

aaagaagcag agcaactaac tgaagaatac actcaaaaat ttaaaaaggt gcgctcaaaa 60 0 

acattcgtga cgcaacacac ggcattttct tatctggcta aacgattcgg cttgaaacaa 660 

cttggtatct cgggtatttc tccagagcaa gagccctctc ctcgccaatt gaaagaaatt 72 0 

caagactttg tcaaagaata caacgtcaag actatttttg cagaagacaa tgtcaatccc 7 80 

aaaattgctc atgctattgc gaaatcaaca ggagctaaag taaagacatt aagtccactt 840 

gaagctgctc caagcggaaa caagacatat ctagaaaatc ttagagcaaa tttggaagtg 9 00 

ctctatcaac agttgaagta a 921 
<212> Type : DNA 
<211> Length : 921 

SequenceName : SEQ ID 642 

SequenceDescription : 



Sequence 



<213> OrganismName : Streptococcus pyogenes strain MGAS8232 
<400> PreSequenceString : 

gtggagaaaa agcaacgttt ttcccttaga aaatacaaat caggaacgtt ttcggtctta 60 

ataggaagcg tt'ttcttgat gatgacaaca acagtagcag cagatgagct aagcacaatg 12 0 

agtgaaccaa caatcacgaa tcacactcaa caacaagcgc aacatctcac caatacagag 18 0 

ttgagctcag ctgaatcaaa atctcaagac acatcacaaa tcactcccaa gacaaatcgt 24 0 

gaaaaagagc aaccacaagg tctagtctct gagccaacca caactgagct agctgacaca 3 00 

gatgcagcac caatggctaa tacaggtcct gatgcgactc aaaaaagcgc ttctttaccg 3 60 

ccagtcaata cagatgttca cgattgggta aaaaccaaag gagcttggga caagggatac 42 0 

aaaggacaag gcaaggttgt cgcagttatt gacacaggga tcgatccggc ccatcaaagc 480 

atgcgcatca gtgatgtatc aactgctaaa gtaaaatcaa aagaagacat gctagcacgc 54 0 

caaaaagccg ccggtattaa ttatgggagt tggataaatg ataaagttgt ttttgcacat 600 

aattatgtgg aaaatagcga taatatcaaa gaaaatcaat tcgaggattt tgatgaggac 660 

tgggaaaact ttgagtttga tgcagaggca gagccaaaag ccatcaaaaa acacaagatc 72 0 

tatcgtcccc aatcaaccca ggcaccgaaa gaaactgtta tcaaaacaga agaaacagat 78 0 

ggttcacatg atattgactg gacacaaaca gacgatgaca ccaaatacga gtcaca'cggt 840 

atgcatgtga caggtattgt agccggtaat agcaaagaag ccgctgctac tggagaacgc 90 0 

tttttaggaa ttgcaccaga ggcccaagtc atgttcatgc gtgtttttgc caacgacgtc 960 

atgggatcag ctgaatcact ctttatcaaa gctatcgaag atgccgtggc tttaggagca 1020 

gatgtgatca acctgagtct tggaaccgct aatggggcac agcttagtgg cagcaagcct 1080 

ctaatggaag caattgaaaa agctaaaaaa gccggtgtat cagttgttgt agcagcagga 1140 

aatgagcgcg tctatggatc tgaccatgat gatccattgg caataaatcc agactatggt 1200 

ttggtcggtt ctccctcaac aggtcgaaca ccaacatcag tggcagctat aaacagtaag 1260 

tgggtgattc aacgtctaat gacggtcaaa gaattagaaa accgtgccga tttaaaccat 1320 

ggtaaagcca tctattcaga gtctgtcgac tttaaaaaca taaaagatag cctaggttat 138 0 

gataaatcgc atcaatttgc ttatgtcaaa gagtcaactg atgcgggtta taaagcacaa 1440 

gacgttaaag ataaaattgc tttaattgaa cgtgatccca ataaaaccta tgacgaaatg 1500 

attgctttgg ctaagaaaca tggagccctg ggagtactta tttttaataa caagcctggt 1560 
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caatcaaacc gctcaatgcg tctaacagct aatgggatgg ggataccatc tgctttcata 162 0 

tcgcacgaat ttggtaaggc catgtcccaa ttaaatggca atggtacagg aagtfctagag 1680 

tttgacagtg tggtctcaaa agcaccgagt caaaaaggca atgaaafcgaa tcatttttca 1740 

aattggggcc taacttctga tggctattta aaacctgaca ttactgcacc aggtggcgat 1800 

5 atctactcta cctataacga taaccactat ggtagccaaa caggaacaag tatggcctct 1860 

cctcagattg ctggcgccag ccttttggtc aaacaatacc tagaaaagac tcagccaaac 192 0 

ttgccaaaag aaaaaattgc tgatatcgtt aagaacctat tgatgagcaa tgctcaaatt 1980 

catgttaatc cagagacaaa aacgaccacc tcaccgcgtc agcaaggggc aggattactt 2040 

aatattgacg gagctgtcac tagcggactt tatgtgacag gaaaagacaa ctatggcagt 2100 

10 atatcattag gcaacatcac agatacgatg acgtttgatg tgacfcgttca caacctaagc 2160 

aataaagaca aaacattacg ttatgacaca gaattgctaa cagatcatgt agacccacaa 2220 

aagggccgct tcactttgac ttctcgctcc ttaaaaacgt accaaggagg agaagttaca 2280 

gtcccagcta atggaaaagt gactgtaagg gttaccatgg atgtctcaca gttcacaaaa 2340 

r gagetaacaa aacagatgtc aaatggUtac tatctagaag gttttgtccg ctttagagat 2400 

15 agtcaagatg accaactaaa tagagtaaac attccttttg ttggttttaa aggacaattt .. 2460 

gaaaacttag cagttgcaga agagtccatt tacaggttaa aatctcaagg caaaaccggt 252 0 

ttttactttg atgaatcagg tccaaaagac gatatctatg tcggtaaaca ctttacagga 2580 

ctcgtcactc ttggttcaga gaccaatgtg tcaaccaaaa cgatttctga caatggtcta 2640 

cacacacttg gtacctttaa aaatgcagat ggcaaattta tcttagaaaa aaafcgcccaa 2700 

20 ggaaaccctg tcttagccat ttctccaaat ggtgataaca accaagattt tgcagccttc 2760 

aaaggtgttt tcttgagaaa atatcaaggc ttaaaagcaa gtgtctacca tgctagtgac 282 0 

aaggaacaca aaaatccact gtgggtcagc ccagaaagct ttaaaggaga taaaaacttt 2880 

aatagtgaca ttagatttgc aaaatcaacg accctgttag gcacagcgtt ttctggaaaa 2940 

tcgttaacag gagctgaatt accagatggg tattatcatt atgtagtgtc ttattaccca 3 0 00 

25 gafcgtggtcg gtgccaaacg tcaagaaatg acatttgaca tgattttaga ccgacaaaaa 306 0 

ccggtactat cacaagcaac atttgatcct gaaacaaacc gattcaaacc agaaccccta 312 0 

aaagaccggg gattagctgg tgttcgcaaa gacagtgtct tttatctaga aagaaaagac 3180 

aacaagcctt atacagttac gataaacgat agctacaaat atgtctcagt agaagacaat 3240 

aaaacatttg tggagcgaca agctgatggc agctttatct tgccgcttga taaagcaaaa 33 0 0 

30 ttaggggatt tctattacat ggtcgaggat tttgcaggga acgtggccat cgctaagtta 3360 

ggagatcacc taccacaaac attaggtaaa acaccaatta aacttaagct tacagacggt 342 0 

aattatcaga ccaaagaaac gcttaaagat aatcttgaaa tgacacagtc tgacacaggt 3480 

ctagtcacaa atcaagccca gctagcagtg gtgcaccgca atcagccgca aagccagcta 3540 

acaaagatga atcaggattt ctttatctca ccaaacgaag afcgggaataa agacttcgtg 3600 

35 gcctttaaag gcttgaaaaa taacgtatat aatgacttaa cggttaacgt atacgctaaa 3660 

gatgaccacc aaaaacaaac ccctatctgg tctagtcaag caggcgctag tgcatcagct 3720 

attgaaagta cagcctggta tggcataaca gcccgaggaa gcaaggtgat gccaggtgat 3780 

tatcagtatg ttgtgactta tcgtgacgaa catggtaaag aacatcaaaa gcagtacacc 3 840 

atatctgtga atgacaaaaa accaatgatc actcagggac gttttgatac cattaatggc 390 0 

40 gttgaccact ttactcctga caagacaaaa gcccttggct catcaggcat tgtccgcgaa 3960 

gaagtctttt acttggccaa gaaaaatggc cgtaaatttg atgtgacaga aggtaaagat 402 0 

ggtatcacag ttagtgacaa taagatgtat atccctaaaa atccagatgg ttcttacacc 4080 

atttcaaaaa gagatggtgt cacactgtca gattattact accttgtcga agatagagct 4140 

ggtaatgtgt cttttgctac cttgcgtgac ctaaaagcgg tcggaaaaga caaagcagta 42 00 

45 gttaactttg gattagactt accggtccct gaagacaaac aaatagtgaa ctttacttac 4260 

cttgtgcggg atgcagatgg taaaccgatt gaaaacctag agtattataa taactcaggt 4320 

aacagtctta tcttgccata cggcaaatac acggtcgaat tgttgaccta tgacaccaat 4380 

gcagccaaac tagagtcaga taaaatcgtt tcctttacct tatcagctga taataacttc 4440 

caacaagtta cctttaagat gacgatgtta gcaacctctc aaataactgc ccactttgat 4500 

50 catcttttgc cagaaggcag tcgcgttagc cttaaaacag ctcaaggtca gctgatcccg 4560 

cttgaacagt ccttgtatgt gcctaaagct tatggcaaaa ccgttcaaga aggcacttac 4620 

gaagttgttg tcagcctgcc taaaggctac cgtatcgaag gcaacacaaa ggtgaatacc 4680 

ctaccaaatg aagtgcacga actatcatta cgccttgtca aagtaggaga tgcctcagat 4740 

tcaactggcg atcataaggt tatgtcaaaa aataattcac aggctttgac agcctttgcc 48 00 

55 acaccaacca agacaacgac ctcagcaaca gcaaaagccc taccatcagc gggtgaaaaa 4860 

atgggtctca agttgcgcat agtaggtctt gtgttactcg gacttacttg cgtctttagc 492 0 

cgaaaaaaat caaccaaaga ttga 4944 
<212> Type : DNA 
<211> Length : 4944 

60 SequenceName : SEQ ID 643 

SequenceDescription : 

Sequence 



65 <213> OrganisinName : Treponema pallidum subsp. pallidum str. Nichols 
<400> PreSequenceString : 

atgatgcggt cacttttttc aggtgtgtct ggtatgcaga atcatcaaac gcgcatggat 60 
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gtcattggga acaacgtcgc gaacgttaac 
tttcaagatc ttatttctca gcaactgagt 
ggagtgaatc ccaaggaagt gggattgggc 
acgcaaggtg cactgcaaac gacgggtatc 
5 ttttttgtgc tgaaaagtgg ggaaaagacg 
gataatgcgg gcactctcgt gaaccctgcg 
caggacgtgg cgggggagcg tttaattaat 
cccattgggc aaaagataga tgcgcagcag 
gacaagcgtc tgcctgagct tgctgcagat 

10 tggacaactg actttcaagt gtatgatagc 
ttttcgcgtg tgccggggac gaacaatcag 
acagaggtag atacgcaaac gcgtgtaggg 
tttattgtaa attttgataa ttttggacac 
■ -gtgaccggtc ctaccggaca ggfcgctcctt 

15 ccggacgatg cagggcaggt tacgcgccac 
accgcgcgca atacgattac gcagtttgct 
gacggttacg cgatgggata tttggaaaat 
ggtgtgtatt caaatggggt gagccaagac 
aatcaaggtg gtcttgagaa ggcaggagag 

20 atagcgaaca ttagcacgtc gggggtgatg 
gagatgagca acgtagattt aaccgatcaa 
tttcaggcgg gcgcaaagac gattcagaca 
ttgaagcgct ga 
<212> Type : DNA 

25 <211> Length : 1392 

SequenceName : SEQ ID 644 
SequenceDe script ion : 



actaccggtt ttaagcgtgg gcgtgttaat 120 

gcggctgcgc gtccgaatga agaagttgga 18 0 

gtgctgattg caagcatcga tactgttcac 240 

aatacggatg tgtctattca ggggagtggt 3 00 

tttttcaccc gcgcaggtgc ctttggggtt 3 60 

aatggtatgc gcgttcaagg ttggatggcg 420 

tcctctgcac agacgcagga tctcgttatc 480 

accagcactg ttcactatgc ctgtaattta 540 

gcgaacgaag cggacgtgcg taagtccacg 600 

ttcgggcagc agcatacgtt gcagattaac 660 

tggcaggcca ctgtcgcagt ggatccgggg 720 

gtggggacat ctgacggtgc ggcaaacacc 780 

ctcgcttcag tgactgacac tgcagggaac 840 

gaagcgtcgt acgatgttgt cggtgcgaat 900 

gctttcacgc tcaacttggg tgaaattggc .960 

gaacgcagta ctaccaaagc ctaccggcag 1020 

tttaaaatag atcaaagcgg tgtcatcact 1080 

attggccagc tcgcacttgc aggatttgca 1140 

aacacctacg tacaatcgaa caactcaggg 12 00 

gggaagggaa agttgattgc agggacactt 12 60 

tttacggata tgatcattac ccaaaaaggg 13 20 

tcagacacca tgttggatac ggtgttgagt 13 80 

1392 



Sequence 
30 

<213> OrganismName : Treponema pallidum subsp. pallidum str. Nichols 
<40 0> PreSequenceString : 

atggggtgca tgcggtgggg gagtgtgctg tgtgtggtgg tgggggtagg agcgagcggg 60 

ggagtgctcg gacaggagtt ttccccgaag ctaactggct ctgccacact tgagtggggc 120 

35 atcagctatg gcaagggggt aggcagtcat ggccaggccc ctggtgcagt tatgggcacc 180 

ggtccctaca atctgaagca cgggtttcgt actaccaaca cggtgggagt atcctttccc 240 

ctggttatgc gcaccaccca cacgcgccgt gggcagcacc cggcactgta tgcggagctg 3 00 

aaggtggcgg acctgcaggc ggacctgagt caggggaagg caggttttgc cgttaagcgc 3 60 

aaggggaagg tagaggcgac actacactgt tatggggcct acctgacgat tgggaagaac 42 0 

40 cccacgtttc tgacgaactt tgcccggctg tggaagccgt gggtgacagc gcagtaccag 480 

gaggatgcgg tacagtatgc gccggggttt gggggtttag gcggcaaggt tgggtatcgg 540 

gcacaggaca ttgggggcag tggggtcagc cttgatgtgg ggtttctctc ctttgcctct 60 0 

aacggtgcct gggatagtac tgaccccacg cacagtaagt atggctttgg ggcagacttg 660 

aagctaatgt atgcgcgtgc aggacaccct ctgtgcacgg tagagcttgc cagcaatgtt 720 

45 acgctagaag acggatacct catcggtgca cagaaggacg caaacaatca gaacaaggat 78 0 

aaactgctgt ggaatgtagg gggccgactc accctcgaac caggcgccgg cttccgcttc 840 

tccttcgccc tcgacgccgg taaccaacac cagagtgcac aggactttca aaatcgcaca 900 

cagagggcgc agagtgaact caccgccctc tcaaataacc tcttccaggg agaaagtcaa 960 

aaacaggaag cctgggtaac ccaggtagtg caacaggcga cgcagacagt aacggctgga 1020 

50 gttcgaagcg cgctggaatc tcgggggact acgtacataa acgcgctaga ggcagttcag 1080 

cctaatcctg ctaaacctac cggtaaggtt gtgcaaaatc ttcacacccc gcagggaagt 114 0 

ccgccgaacc tgccgccgct tcctgcactt cctgcatttt ccctgatggg gcaggttttg 1200 

ctgcagtacg atgcggagca ggtggtgaag gggtttgagc aggtacagac gcaaatcgtc 12 60 

actgaaatta atcagaaagt gcaagcggct gtggcaaaaa ataatgcaaa catgcaagcg 1320 

55 gtcgggggta gtctaggcga tactgcgaga atggtaggcg aagcgctcat taagcagcaa 13 8 0 

ctatcacgta agcagaacag cattctgacc atggtgagcg tgcaagatga ggtgaaacag 144 0 

gatctggcag atttagtgcc gatgatgcga acggaaataa cggcgttttt cgcgagtgtc 1500 

cagcaacaca taaccgaaga agtgaagaag aagacggatg cgttgaatgc ggggcagcag 1560 

atacgtcagg ctatacagaa cctgcgtgcg tctgcatggc gtgcctttct aatgggagtc 1620 

60 agcgccgtgt gtctgtatct tgacacctac aatgtcgcct tcgatgcgct gtttacggcg 168 0 

cagtggaagt ggctgtcttc tggcatatac tttgccacag caccggcaaa cgtttttggc 1740 

accagggtgt tagataacac catcgcaagc tgtggcgact ttgccggatt ccttaagctc 18 0 0 

gaaactaaga gcggtgaccc ctacacccac ctgctcaccg gcctggacgc cggcgttgaa 1860 

acacgcgtgt acatccccct cacccatgac ctgtacaaaa ataataacgg gaaccctctc 192 0 

65 ccttccggcg gttcctcagg gcacattggc ctgccggtgg tggggaaggc gtggtgtagc 1980 

tatcgcatcc cggtgcagga ttacggctgg gtgaagccaa gcgttacggt ccatgcctct 2040 

accaaccgtg cacacctgaa tgcccctgct gcaggtggag cagtaggagc tacctatcta 210 0 
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accaaggagt actgtgcaca gctgcgtgct ggtatttcag ccagtctcat agagaagacg 2160 

gtattctccc ttgattggga acagggtatg ctctctgatg tcccgtacct gctggtgtcc 222 0 

gagtgcctca cccagggaat cggccgcatc gtgtgcggcg tcaccctctc ctggtag 2277 

5 <212> Type : DNA 

<211> Length : 2277 

SequenceName : SEQ ID 645 
SequenceDescription : 

10 Sequence 



<213> OrganismName : Treponema pallidum subsp. pallidum str. Nichols 



<400> PreSequenceString : 

gtgggcaggc aggtgatgca agcgggggta cttgcgggca tggtatgtgc> vtgcttctggt 60 

15 tatgcaggcg tactcactcc gcaggtcagt ggcacagccc agctccagtg gggcattgcg 12 0 

ttccagaaga atccacgcac tggcccgggc aagcacaccc atgggtttcg cactaccaat 18 0 

agtctgacta tttccctgcc gttggtgtca aagcacaccc acacccgccg aggggaggca 240 

cgctcagggg tgtgggcaca gctgcagctg aaggacctgg cagtagagct tgcgtcttct 3 00 

aaaagctcaa cggccctgtc ctttaccaaa cctaccgctt ccttccaggc aaccctgcac 3 60 

20 tgttatgggg cctacctgac agtgggtacc agtccttcct gtgtggttaa ctttgcccag 42 0 

ctgtggaaac cctttgtcac ccgtgcctat tcagaaaagg acactcgcta tgcccctggt 480 

ttctccggct ccggggcaaa actcggctac caggcccaca atgtgggaaa cagcggagta 540 

gatgtggaca tcggtttcct ctccttcctt tccaatggtg cctgggatag tactgacacc 60 0 

acgcacagca agtatggctt cggggccgat gcaacgcttt cctatggcgt cgaccgtcag 660 

25 cggctgctta cgttggagct ggcagggaat gccacactgg accagaacta cgttaagggt 720 

accgaagact ccaagaacga aaacaaaaca gcactcctgt ggggagtagg aggccgactc 780 

accctcgaac caggcgccgg cttccgcttc tccttcgccc tcgacgccgg taaccaacac 840 

cagagtaacg cacatgctca gacccaagag agagctatcc tcaaagcaag ggaagtgttt 900 

agacgggtgg aggggaaact cgtgcagaac cttcccaata tcatgatgcc accaggaatc 960 

30 accgaacaaa ccactctcat agagatggta ggacttgctg ctttgattgc agaaggaacg 1020 

ctcggcagcg ccattcaaac cgtgctagcc gctggcgcgc tcgcggcgct tgtatcgcaa 108 0 

cttgtaccga acatagagca aggagtacgt gatgtcttcc gctcttccga tccaagagtt 1140 

gtcactgcta aacttctcgc tttccttgag cgcgcaccta tgaacgcgct caacatagac 12 0 0 

gcgctcctgc gtatgcagtg gaagtggctc tcttctggca tatactttgc caccgcaggc 1260 

35 actaatatct ttggcaaacg cgtctttgct accactcgtg cgcactactt tgattttgcc 1320 

ggattcctta agctcgaaac caaaagcggt gacccctaca cccacctgct caccggcctg 13 80 

aacgccggcg tcgaagcacg cgtgtacatc cccctcacct acatccgtta cagaaataac 1440 

ggagggtacg aactgaatgg agctgtgccc cctgggacta tcaatatgcc aattttgggg 150 0 

aaggcgtggt gcagctatcg catccccctc ggttcccacg cctggcttgc accacacaca 1560 

40 tccgtgctcg gcacaaccaa tcgctttaac attattaacc ccgcgggcaa cctgttgaat 162 0 

gaacgagcgc tccagtacca ggtgggactg acgttcagtc ccttcgagaa ggtggagctc 1680 

agcgcccagt gggaacaggg cgtgcttgct gacgctcctt acatgggcat tgccgagagc 1740 

atctggtccg aacgccactt cggcaccctt gtctgcggaa tgaaagtgac atggtaa 1797 



45 <212> Type : DNA 

<211> Length : 1797 

SequenceName : SEQ ID 646 
SequenceDescription : 

50 Sequence 



<213> OrganismName : Treponema pallidum subsp. pallidum str. Nichols 



<4 00> PreSequenceString : 

gtgggcaggc aggtgatgca agcgggggta cttgcgggca tggtatgtgc tgcttctggt 60 

55 tatgcaggcg tactcactcc gcaggtcagt ggcacagccc agctccagtg gggcattgcg 120 

ttccagaaga atccacgcac tggcccgggc aagcacaccc atgggtttcg cactaccaat 180 

agtctgacta tttccctgcc gttggtgtca aagcacaccc acacccgccg aggggaggca 240 

cgctcagggg tgtgggcaca gctgcagctg aaggacctgg cagtagagct tgcgtcttct 3 00 

aaaagctcaa cggccctgtc ctttaccaaa cctaccgctt ccttccaggc aaccctgcac 3 60 

60 tgttatgggg cctacctgac agtgggtacc agtccttcct gtgtggttaa ctttgcccag 420 

ctgtggaaac cctttgtcac ccgtgcctat tcagaaaagg acactcgcta tgcccctggt 480 

ttctccggct ccggggcaaa actcggctac caggcccaca atgtgggaaa cagcggagta 540 

gatgtggaca tcggtttcct ctccttcctt tccaatggtg cctgggatag tactgacacc 600 

acgcacagca agtatggctt cggggccgat gcaacgcttt cctatggcgt cgaccgtcag 660 

65 cggctgctta cgttggagct ggcagggaat gccacactgg accagaacta cgttaagggt 720 

accgaagact ccaagaacga aaacaaaaca gcactcctgt ggggagtagg aggccgactc 780 

accctcgaac caggcgccgg cttccgcttc tccttcgccc tcgacgccgg taaccaacac 840 
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cagagtaacg cacatgctca gacccaagag agagctatcc tcaaagcaag ggaagtgttt 900 

agacgggtgg aggggaaact cgtgcagaac cttcccaata tcatgatgcc accaggaatc 960 

accgaacaaa ccactctcat agagatggta ggacttgctg ctttgattgc agaaggaacg 102 0 

ctcggcagcg ccattcaaac cgtgctagcc gctggcgcgc tcgcggcgct tgtatcgcaa 10 8 0 

cttgtaccga acatagagca aggagtacgt gatgtcttcc gctctfcccga tccaagagtt 1140 

gtcactgcta aacttctcgc tttccttgag cgcgcaccta tgaacgcgct caacatagac 12 0 0 

gcgctcctgc gtatgcagtg gaagtggctc tcttctggca tatactttgc caccgcaggc 12 60 

actaatatct ttggcaaacg cgtctttgct accactcgtg cgcactactt tgattttgcc 1320 

ggattcctta agctcgaaac caaaagcggt gacccctaca cccacctgct caccggcctg 13 80 

aacgccggcg tcgaagcacg cgtgtacatc cccctcacct acatccgtta cagaaataac 1440 

ggagggtacg aactgaatgg agctgtgccc cctgggacta tcaatatgcc aattttgggg 15 00 

aaggcgtggt gcagctatcg catccccctc ggttcccacg cctggcttgc accacacaca 1560 

tccgtgctcg gcacaaccaa tcgctttaac attattaacc ccgcgggcaa cctgttgaat 1620 

gaacgagcgc tccagtacca ggtgggactg acgttcagtc ccttcgagaa .ggtggagctc 1680 

agcgcccagt gggaacaggg cgtgcttgct gacgctcctt acatgggcat tgccgagagc 174Q; 

atctggtccg aacgccactt cggcaccctt gtctgcggaa tgaaagtgac atggtaa 1797 



<212> Type : DNA 

<211> Length : 1797 

SequenceName : SEQ ID 647 
SequenceDe script ion : 



Sequence 



<213> OrganistnName : SARS coronavirus Frankfurt 1 
<400> PreSequenceString : 

atgtttattt tcttattatt tcttactctc actagtggta gtgaccttga ccggtgcacc 60 

acttttgatg atgttcaagc tcctaattac actcaacata cttcatctat gaggggggtt 120 

tactatcctg atgaaatttt tagatcagac actctttatt taactcagga tttatttctt 180 

ccattttatt ctaatgttac agggtttcat actattaatc atacgtttgg caaccctgtc 240 

atacctttta aggatggtat ttattttgct gccacagaga aatcaaatgt tgtccgtggt 3 00 

tgggtttttg gttctaccat gaacaacaag tcacagtcgg tgattattat taacaattct 3 60 

actaatgttg ttatacgagc atgtaacttt gaattgtgtg acaacccttt ctttgctgtt 42 0 

tctaaaccca tgggtacaca gacacatact atgatattcg ataatgcatt taattgcact 480 

ttcgagtaca tatctgatgc cttttcgctt gatgtttcag aaaagtcagg taattttaaa 540 

cacttacgag agtttgtgtt taaaaataaa gatgggtttc tctatgttta taagggctat 600 

caacctatag atgtagttcg tgatctacct tctggtttta acactttgaa acctattttt 660 

aagttgcctc ttggtattaa cattacaaat tttagagcca ttcttacagc cttttcacct 720 

gctcaagaca tttggggcac gtcagctgca gcctattttg ttggctattt aaagccaact 780 

acatttatgc tcaagtatga tgaaaatggt acaatcacag atgctgttga ttgttctcaa 840 

aatccacttg ctgaactcaa atgctctgtt aagagctttg agattgacaa aggaatttac 90 0 

cagacctcta atttcagggt tgttccctca ggagatgttg tgagattccc taatattaca 9 60 

aacttgtgtc cttttggaga ggtttttaat gctactaaat tcccttctgt ctatgcatgg 1020 

gagagaaaaa aaatttctaa ttgtgttgct gattactctg tgctctacaa ctcaacattt 10 80 

ttttcaacct ttaagtgcta tggcgtttct gccactaagt tgaatgatct ttgcttctcc 1140 

aatgtctatg cagattcttt tgtagtcaag ggagatgatg taagacaaat agcgccagga 12 0 0 

caaactggtg ttattgctga ttataattat aaattgccag atgatttcat gggttgtgtc 12 60 

cttgcttgga atactaggaa cattgatgct acttcaactg gtaattataa ttataaatat 13 2 0 

aggtatctta gacatggcaa gcttaggccc tttgagagag acatatctaa tgtgcctttc 13 80 

tcccctgatg gcaaaccttg caccccacct gctcttaatt gttattggcc attaaatgat 1440 

tatggttttt acaccactac tggcattggc taccaacctt acagagttgt agtactttct 150 0 

tttgaacttt taaatgcacc ggccacggtt tgtggaccaa aattatccac tgaccttatt 1560 

aagaaccagt gtgtcaattt taattttaat ggactcactg gtactggtgt gttaactcct 162 0 

tcttcaaaga gatttcaacc atttcaacaa tttggccgtg atgtttctga tttcactgat 1680 

tccgttcgag atcctaaaac atctgaaata ttagacattt caccttgctc ttttgggggt 1740 

gtaagtgtaa ttacacctgg aacaaatgct tcatctgaag ttgctgttct atatcaagat 1800 

gttaactgca ctgatgtttc tacagcaatt catgcagatc aactcacacc agcttggcgc 18 60 

atatattcta ctggaaacaa tgtattccag actcaagcag gctgtcttat aggagctgag 192 0 

catgtcgaca cttcttatga gtgcgacatt cctattggag ctggcatttg tgctagttac 1980 

catacagttt ctttattacg tagtactagc caaaaatcta ttgtggctta tactatgtct 2040 

ttaggtgctg atagttcaat tgcttactct aataacacca ttgctatacc tactaacttt 2100 

tcaattagca ttactacaga agtaatgcct gtttctatgg ctaaaacctc cgtagattgt 2160 

aatatgtaca tctgcggaga ttctactgaa tgtgctaatt tgcttctcca atatggtagc 2220 

ttttgcacac aactaaatcg tgcactctca ggtattgctg ctgaacagga tcgcaacaca 22 80 

cgtgaagtgt tcgctcaagt caaacaaatg tacaaaaccc caactttgaa atattttggt 2340 

ggttttaatt tttcacaaat attacctgac cctctaaagc caactaagag gtcttttatt 2400 

gaggacttgc tctttaataa ggtgacactc gctgatgctg gcttcatgaa gcaatatggc 2460 
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gaatgcctag gtgatattaa tgctagagat ctcatttgtg cgcagaagtt caatggactt 252 0 

acagtgttgc cacctctgct cactgatgat atgattgctg cctacactgc tgcfcctagtt 258 0 

agtggtactg ccactgctgg atggacattt ggtgctggcg ctgctcttca aatacctttt 2640 

gctatgcaaa tggcatatag gttcaatggc attggagtta cccaaaatgt tctctatgag 2700 

aaccaaaaac aaatcgccaa ccaatttaac aaggcgatta gtcaaattca agaatcactt 2760 

acaacaacat caactgcatt gggcaagctg caagacgttg ttaaccagaa tgctcaagca 2 820 

ttaaacacac ttgttaaaca acttagctct aattttggtg caatttcaag tgtgctaaat 2880 

gatatccttt cgcgacttga taaagtcgag gcggaggtac aaattgacag gttaattaca 2940 

ggcagacttc aaagccttca aacctatgta acacaacaac taatcagggc tgctgaaatc 30 00 

agggcttctg ctaatcttgc tgctactaaa atgtctgagt gtgttcttgg acaatcaaaa 3060 

agagttgact tttgtggaaa gggctaccac cttatgtcct tcccacaagc agccccgcat 3120 

ggtgttgtct tcctacatgt cacgtatgtg ccatcccagg agaggaactt caccacagcg 3180 

ccagcaattt gtcatgaagg caaagcatac ttccctcgtg aaggtgtttt tgtgtttaat 3240 

ggcaefctctfc ggtttattac acagaggaac, ttcttttctc cacaaataat tactacagac 33 0 0 

aatacatttg tctcaggaaa ttgtgatgtc gttattggca tcattaacaa cacagtttat 3 360 

gatcctctgc aacctgagct tgactcattc aaagaagagc tggacaagta cttcaaaaat 3420 

catacatcac cagatgttga ttttggcgac atttcaggca ttaacgcttc tgtcgtcaac 3480 

attcaaaaag aaattgaccg cctcaatgag gtcgctaaaa atttaaatga atcactcatt 3540 

gaccttcaag aattgggaaa atatgagcaa tatattaaat ggccttggta tgtttggctc 3600 

ggcttcattg ctggactaat tgccatcgtc atggttacaa tcttgctttg ttgcatgact 3660 

agttgttgca gttgcctcaa gggtgcatgc tcttgtggtt cttgctgcaa gtttgatgag 3720 

gatgactctg agccagttct caagggtgtc aaattacatt acacataa 3768 
<212> Type : DNA 
<211> Length : 3768 

SequenceName : SEQ ID 648 

SequenceDescription : 

Sequence 



<213> OrganismMame : SARS coronavirus HSR 1 
<400> PreSequenceString : 

atgtttafctt tcttattatt tcttactctc actagtggfca gtgaccttga ccggtgcacc 60 

acttttgatg atgttcaagc tcctaattac actcaacata cttcatctat gaggggggtt . 120 

tactatcctg atgaaatttt tagatcagac actctttatt taactcagga tttatttctt : f 180 

ccattttatt ctaatgttac agggtttcat actattaatc atacgtttgg caaccctgtc 240 

atacctttta aggatggtat ttattttgct gccacagaga aatcaaatgt tgtccgtggt 3 00 

tgggtttttg gttctaccat gaacaacaag tcacagtcgg tgattattat taacaattct 3 60 

actaatgttg ttatacgagc atgtaacttt gaattgtgtg acaacccttt ctttgctgtt 420 

tctaaaccca tgggtacaca gacacatact atgatattcg ataatgcatt taattgcact 480 

ttcgagtaca tatctgatgc cttttcgctt gatgtttcag aaaagtcagg taattttaaa 540 

cacttacgag agtttgtgtt taaaaataaa gatgggtttc tctatgttta taagggctat 600 

caacctatag atgtagttcg tgatctacct tctggtttta acactttgaa acctattttt 660 

aagttgcctc ttggtattaa cattacaaat tttagagcca ttcttacagc cttttcacct 72 0 

gctcaagaca tttggggcac gtcagctgca gcctattttg ttggctattt aaagccaact 780 

acatttatgc tcaagtatga tgaaaatggt acaatcacag atgctgttga ttgttctcaa 84 0 

aatccacttg ctgaactcaa atgctctgtt aagagctttg agattgacaa aggaatttac 900 

cagacctcta atttcagggt tgttccctca ggagatgttg tgagattccc taatattaca 960 

aacttgtgtc cttttggaga ggtttttaat gctactaaat tcccttctgt ctatgcatgg 102 0 

gagagaaaaa aaatttctaa ttgtgttgct gattactctg tgctctacaa ctcaacattt 108 0 

ttttcaacct ttaagtgcta tggcgtttct gccactaagt tgaatgatct ttgcttctcc 114 0 

aatgtctatg cagattcttt tgtagtcaag ggagatgatg taagacaaat agcgccagga 12 00 

caaactggtg ttattgctga ttataattat aaattgccag atgatttcat gggttgtgtc 12 60 

cttgcttgga atactaggaa cattgatgct acttcaactg gtaattataa ttataaatat 132 0 

aggtatctta gacatggcaa gcttaggccc tttgagagag acatatctaa tgtgcctttc 13 80 

tcccctgatg gcaaaccttg caccccacct gctcttaatt gttattggcc attaaatgat 1440 

tatggttttt acaccactac tggcattggc taccaacctt acagagttgt agtactttct 15 00 

tttgaacttt taaatgcacc ggccacggtt tgtggaccaa aattatccac tgaccttatt 1560 

aagaaccagt gtgtcaattt taattttaat ggactcactg gtactggtgt gttaactcct 1620 

tcttcaaaga gatttcaacc atttcaacaa tttggccgtg atgtttctga tttcactgat 168 0 

tccgttcgag atcctaaaac atctgaaata ttagacattt caccttgctc ttttgggggt 1740 

gtaagtgtaa ttacacctgg aacaaatgct tcatctgaag ttgctgttct atatcaagat 1800 

gttaactgca ctgatgtttc tacagcaatt catgcagatc aactcacacc agcttggcgc 1860 

atatattcta ctggaaacaa tgtattccag actcaagcag gctgtcttat aggagctgag 192 0 

catgtcgaca cttcttatga gtgcgacatt cctattggag ctggcatttg tgctagttac 1980 

catacagttt ctttattacg tagtactagc caaaaatcta ttgtggctta tactatgtct 2040 

ttaggtgctg atagttcaat tgcttactct aataacacca ttgctatacc tactaacttt 2100 

tcaattagca ttactacaga agtaatgcct gtttctatgg ctaaaacctc cgtagattgt 2160 
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aatatgtaca tctgcggaga ttctactgaa tgtgctaatt tgcttctcca atatggtagc 222 0 

ttttgcacac aactaaatcg tgcactctca ggtattgctg ctgaacagga tcgcaacaca 228 0 

cgtgaagtgt tcgctcaagt caaacaaatg tacaaaaccc caactttgaa atattttggt 2340 

ggttttaatt tttcacaaat attacctgac cctctaaagc caactaagag gtcttttatt 240 0 

5 gaggacttgc tctttaataa ggtgacactc gctgatgctg gcttcatgaa gcaatatggc 2460 

gaatgcctag gtgatattaa tgctagagat ctcatttgtg cgcagaagtt caatggactt 252 0 

acagtgttgc cacctctgct cactgatgat atgattgctg cctacactgc tgctctagtt 2580 

agtggtactg ccactgctgg atggacattt ggtgctggcg ctgctcttca aatacctttt 2640 

gctatgcaaa tggcatatag gttcaatggc attggagtta cccaaaatgt tctctatgag 2700 

10 aaccaaaaac aaatcgccaa ccaatttaac aaggcgatta gtcaaattca agaatcactt 2760 

acaacaacat caactgcatt gggcaagctg caagacgttg ttaaccagaa tgctcaagca 2820 

ttaaacacac ttgttaaaca acttagctct aattttggtg caatttcaag tgtgctaaat 2880 

gatatccttt cgcgacttga taaagtcgag gcggaggtac aaattgacag gttaattaca 2940 

ggcagacttc- aaagccttca aacctatgta acacaacaac taatcagggc tgctgaaatq 3 000 

15 agggcttctg ctaatcttgc tgctactaaa atgtctgagt gtgttcttgg acaatcaaaa 3 060 

agagttgact tttgtggaaa gggctaccac cttatgtcct tcccacaagc agccccgcat 312 0 

ggtgttgtct tcctacatgt cacgtatgtg ccatcccagg agaggaactt caccacagcg 3180 

ccagcaattt gtcatgaagg caaagcatac ttccctcgtg aaggtgtttt tgtgtttaat 3240 

ggcacttctt ggtttattac acagaggaac ttcttttctc cacaaataat tactacagac 33 00 

20 aatacatttg tctcaggaaa ttgtgatgtc gttattggca tcattaacaa cacagtttat ' 3360 

gatcctctgc aacctgagct tgactcattc aaagaagagc tggacaagta cttcaaaaat 342 0 

catacatcac cagatgttga tcttggcgac atttcaggca ttaacgcttc tgtcgtcaac 3480 

attcaaaaag aaattgaccg cctcaatgag gtcgctaaaa atttaaatga atcactcatt 3 540 

gaccttcaag aattgggaaa atatgagcaa tatattaaat ggccttggta tgtttggctc 3 600 

25 ggcttcattg ctggactaat tgccatcgtc atggttacaa tcttgctttg ttgcatgact 3 660 

agttgttgca gttgcctcaa gggtgcatgc tcttgtggtt cttgctgcaa gtttgatgag 372 0 

gatgactctg agccagttct caagggtgtc aaattacatt acacataa 3 768 
<212> Type : DNA 
<211> Length : 3768 

30 SequenceName : SEQ ID 649 

SequenceDescription : 

Sequence ■ • 



35 <213> OrganisraName : SARS coronavirus 20*01 
<400> PreSequenceString : 

atgtttattt tcttattatt tcttactctc actagtggta gtgaccttga ccggtgcacc 60 

acttttgatg atgttcaagc tcctaattac actcaacata cttcatctat gaggggggtt 120 

tactatcctg atgaaatfctt tagatcagac actctttatt taactcagga tttatttctt 180 

40 ccattttatt ctaatgttac agggtttcat actattaatc atacgtttgg caaccctgtc 24 0 

atacctttta aggatggtat ttattttgct gccacagaga aatcaaatgt tgtccgtggt 300 

tgggtttttg gttctaccat gaacaacaag tcacagtcgg tgattattat taacaattct 360 

actaatgttg ttatacgagc atgtaacttt gaattgtgtg acaacccttt ctttgctgtt 420 

tctaaaccca tgggtacaca gacacatact atgatattcg ataatgcatt taattgcact 48 0 

45 ttcgagtaca tatctgatgc cttttcgctt gatgtttcag aaaagtcagg taattttaaa 540 

cacttacgag agtttgtgtt taaaaataaa gatgggtttc tctatgttta taagggctat 600 

caacctatag atgtagttcg tgatctacct tctggtttta acactttgaa acctattttt 660 

aagttgcctc ttggtattaa cattacaaat tttagagcca ttcttacagc cttttcacct 720 

gctcaagaca tttggggcac gtcagctgca gcctattttg ttggctattt aaagccaact 780 

50 acatttatgc tcaagtatga tgaaaatggt acaatcacag atgctgttga ttgttctcaa 840 

aatccacttg ctgaactcaa atgctctgtt aagagctttg agattgacaa aggaatttac 90 0 

cagacctcta atttcagggt tgttccctca ggagatgttg tgagattccc taatattaca 960 

aacttgtgtc cttttggaga ggtttttaat gctactaaat tcccttctgt ctatgcatgg 102 0 

gagagaaaaa aaatttctaa ttgtgttgct gattactctg tgctctacaa ctcaacattt 1080 

55 ttttcaacct ttaagtgtta tggcgtttct gccactaagt tgaatgatct ttgcttctcc 114 0 

aatgtctatg cagattcttt tgtagtcaag ggagatgatg taagacaaat agcgccagga 12 00 

caaactggtg ttattgctga ttataattat aaattgccag atgatttcat gggttgtgtc 1260 

cttgcttgga atactaggaa cattgatgct acttcaactg gtaattataa ttataaatat 132 0 

aggtatctta gacatggcaa gcttaggccc tttgagagag acatatctaa tgtgcctttc 13 8 0 

60 tcccctgatg gcaaaccttg caccccacct gctcttaatt gttattggcc attaaatgat 1440 

tatggttttt acaccactac tggcattggc taccaacctt acagagttgt agtactttct 1500 

tttgaacttt taaatgcacc ggccacggtt tgtggaccaa aattatccac tgaccttatt 1560 

aagaaccagt gtgtcaattt taattttaat ggactcactg gtactggtgt gttaactcct 162 0 

tcttcaaaga gatttcaacc atttcaacaa tttggccgtg atgtttctga tttcactgat 1680 

65 tccgttcgag atcctaaaac atctgaaata ttagacattt caccttgctc ttttgggggt 1740 

gtaagtgtaa ttacacctgg aacaaatgct tcatctgaag ttgctgttct atatcaagat 180 0 

gttaactgca ctgatgtttc tacagcaatt catgcagatc aactcacacc agcttggcgc 1860 
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atatattcta ctggaaacaa tgtattccag actcaagcag gctgtcttat aggagctgag 192 0 

catgtcgaca cttcttatga gtgcgacatt cctattggag ctggcatttg tgctagttac 19 8 0 

catacagttt ctttattacg tagtactagc caaaaatcta ttgtggctta tactatgtct 2040 

ttaggtgctg atagttcaat tgcttactct aataacacca ttgctatacc fcactaacttt 2100 

5 tcaattagca ttactacaga agtaatgcct gtttctatgg ctaaaacctc cgtagattgt 2160 

aatatgtaca tctgcggaga ttctactgaa tgtgctaatt tgcttctcca atatggtagc 2220 

ttttgcacac aactaaatcg tgcactctca ggtattgctg cfcgaacagga tcgcaacaca 22 8 0 

cgtgaagtgt tcgctcaagt caaacaaatg tacaaaaccc caactttgaa atattttggt 2340 

ggttttaatt tttcacaaat attacctgac cctctaaagc caactaagag gtcttttatt 240 0 

10 gaggacttgc tctttaataa ggtgacactc gctgatgctg gcttcatgaa gcaatatggc 2460 

gaatgcctag gtgatattaa tgctagagat ctcatttgtg cgcagaagtt caatggactt 2520 

acagtgttgc cacctctgct cactgatgat atgattgctg cctacactgc tgctctagtt 258 0 

agtggtactg ccactgctgg atggacattt ggtgctggcg ctgctcttca aatacctttt 2640 

gctatgcaaa tggcatatag gttcaatggc attggegtta cccaaaat^t tctctat;gag 2700 

15 aaccaaaaac aaatcgccaa ccaatttaac aaggcgatta gtcaaattca agaatcactt 2760 

acaacaacat caactgcatt gggcaagctg caagacgttg ttaaccagaa tgctcaagca 282 0 

ttaaacacac ttgttaaaca acttagctct aattttggtg caatttcaag tgtgctaaat 2 88 0 

gatatccttt cgcgacttga taaagtcgag gcggaggtac aaattgacag gttaattaca 294 0 

ggcagacttc aaagccttca aacctatgta acacaacaac taatcagggc tgctgaaatc 300 0 

20 agggcttctg ctaatcttgc tgctactaaa atgtctgagt gtgttcttgg acaatcaaaa 3 060 

agagttgact tttgtggaaa gggctaccac cttatgtcct tcccacaagc agccccgcat 312 0 

ggtgttgtct tcctacatgt cacgtatgtg ccatcccagg agaggaactt caccacagcg 318 0 

ccagcaattt gtcatgaagg caaagcatac ttccctcgtg aaggtgtttt tgtgtttaat 3240 

ggcacttctt ggtttattac acagaggaac ttcttttctc cacaaataafc tactacagac 33 00 

25 aatacatttg tctcaggaaa ttgtgatgtc gttattggca tcattaacaa cacagtttat 33 60 

gatcctctgc aacctgagct tgactcattc aaagaagagc tggacaagta cttcaaaaat 342 0 

catacatcac cagatgttga tcttggcgac atttcaggca ttaacgcttc tgtcgtcaac 348 0 

attcaaaaag aaattgaccg cctcaatgag gtcgctaaaa atttaaatga atcactcatt 3540 

gaccttcaag aattgggaaa atatgagcaa tatattaaat ggccttggta tgtttggctc 360 0 

30 ggcttcattg ctggactaat tgccatcgtc atggttacaa tcttgctttg ttgcatgact 3 660 

agttgfctgca gttgcctcaa gggtgcatgc tcttgtggtt cttgctgcaa gtttgatgag 3720 

gatgactcfcg agccagttct caagggtgtc aaattacatt acacataa 3768 
<212> Type : DNA 
<211> Length : 3768 

35 SeguenceName : SBQ ID 650 

SequenceDescription : 



Sequence 



40 <213> OrganismName : SARS coronavirus TW1 
<400> PreSequenceString : 

atgtttattt tcttattatt tcttactctc actagtggta gtgaccttga ccggtgcacc 60 

acttttgatg atgttcaagc tcctaattac actcaacata cttcatctat gaggggggtt 12 0 

tactatcctg atgaaatttt tagatcagac actctttatt taactcagga tttatttctt 180 

45 ccattfctatt ctaatgttac agggtttcat actattaatc atacgtttgg caaccctgtc 240 

atacctttta aggatggtat ttattttgct gccacagaga aatcaaatgt tgtccgtggt 3 00 

tgggtttttg gttctaccat gaacaacaag tcacagtcgg tgattattat taacaattct 3 60 

actaatgttg ttatacgagc atgtaacttt gaattgtgtg acaacccttt ctttgctgtt 420 

tctaaaccca tgggtacaca gacacatact atgatattcg ataatgcatt taattgcact 480 

50 ttcgagtaca tatctgatgc cttttcgctt gatgtttcag aaaagtcagg taattttaaa 540 

cacttacgag agtttgtgtt taaaaataaa gatgggtttc tctatgttta taagggctat . 600 

caacctatag atgtagttcg tgatctacct tctggtttta acactttgaa acctattttt 660 

aagttgcctc ttggtattaa cattacaaat tttagagcca ttcttacagc cttttcacct 720 

gctcaagaca tttggggcac gtcagctgca gcctattttg ttggctattt aaagccaact 780 

55 acatttatgc tcaagtatga tgaaaatggt acaatcacag atgctgttga ttgttctcaa 840 

aatccacttg ctgaactcaa atgctctgtt aagagctttg agattgacaa aggaatttac 900 

cagacctcta atttcagggt tgttccctca ggagatgttg tgagattccc taatattaca 960 

aacttgtgtc cttttggaga ggtttttaat gctactaaat tcccttctgt ctatgcatgg 102 0 

gagagaaaaa aaatttctaa ttgtgttgct gattactctg tgctctacaa ctcaacattt 10 80 

60 ttttcaacct ttaagtgcta tggcgtttct gccactaagt tgaatgatct ttgcttctcc 1140 

aatgtctatg cagattcttt tgtagtcaag ggagatgatg taagacaaat agcgccagga 12 00 

caaactggtg ttattgctga ttataattat aaattgccag atgatttcat gggttgtgtc 12 60 

cttgcttgga atactaggaa cattgatgct acttcaactg gtaattataa ttataaatat 13 20 

aggtatctta gacatggcaa gcttaggccc tttgagagag acatatctaa tgtgcctttc 13 80 

65 tcccctgatg gcaaaccttg caccccacct gctcttaatt gttattggcc atfcaaatgat 1440 

tatggttttt acaccactac tggcattggc taccaacctt acagagttgt agtactttct 1500 

tttgaacttt taaatgcacc ggccacggtt tgtggaccaa aattatccac tgaccttatt 1560 
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aagaaccagt gtgtcaattt taattttaat 
tcttcaaaga gatttcaacc atttcaacaa 
tccgttcgag atcctaaaac atctgaaata 
gtaagtgtaa ttacacctgg aacaaatgct 
5 gttaactgca ctgatgtttc tacagcaatt 
atatattcta ctggaaacaa tgtattccag 
catgtcgaca cttcttatga gtgcgacatt 
catacagttt ctttattacg tagtactagc 
ttaggtgctg atagttcaat tgcttactct 

10 tcaattagca ttactacaga agtaatgcct 
aatatgtaca tctgcggaga ttctactgaa 
ttttgcacac aactaaatcg tgcactctca 
cgtgaagtgt tcgctcaagt caaacaaatg 
, ggttttaatt tttcacaaat attacctgac 

15 gaggacttgc tctttaataa ggtgacactc 
gaatgcctag gtgatattaa tgctagagat 
acagtgttgc cacctctgct cactgatgat 
agtggtactg ccaetgctgg atggacattt 
gctatgcaaa tggcatatag gttcaatggc 

20 aaccaaaaac aaatcgccaa ccaatttaac 
acaacaacat caactgcatt gggcaagctg 
ttaaacacac ttgttaaaca acttagctct 
gatatccttt cgcgacttga taaagtcgag 
ggcagacttc aaagccttca aacctatgta 

25 agggcttctg ctaatcttgc tgctactaaa 
agagttgact tttgtggaaa gggctaccac 
ggtgttgtct tcctacatgt cacgtatgtg 
ccagcaattt gtcatgaagg caaagcatac 
ggcacttctt ggtttattac acagaggaac 

30 aatacatttg tctcaggaaa ttgtgatgtc 
gatcctctgc aacctgagct tgactcattc 
catacatcac cagatgttga tcttggcgac 
attcaaaaag aaafctgaccg cctcaatgag 
gaccfctcaag aattgggaaa atatgagcaa 

35 ggcttcattg ctggactaafc tgccatcgtc 
agtfcgtfcgca gttgcctcaa gggfcgcatgc 
gatgactctg agccagttct: caagggtgtc 
<212> Type : DNA 
<211> Length : 3 768 

40 SequenceName : SEQ ID 651 

SequenceDescription : 



ggactcactg gtactggtgt gttaactcct 1620 

tttggccgtg atgtttctga tttcactgat 168 0 

ttagacattt caccttgctc ttttgggggt 1740 

tcatctgaag ttgctgttct atatcaagat 18 00 

catgcagatc aactcacacc agcttggcgc 18 60 

actcaagcag gctgtctfcat aggagctgag 192 0 

cctattggag ctggcatttg tgctagttac 1980 

caaaaatcta ttgtggctta tactatgtct 2 04 0 

aataacacca ttgctatacc tactaacttt 2100 

gtttctatgg ctaaaacctc cgtagattgt 2160 

tgtgctaatt tgcttctcca atatggtagc 2220 

ggtattgctg ctgaacagga tcgcaacaca 22 8 0 

tacaaaaccc caactttgaa atattttggt 2340 

cctctaaagc caactaagag ,gtcttttatfe 2400 

gctgatgctg gcttcatgaa gcaatatggc 24 60 

ctcatttgtg cgcagaagtt caatggactt 2520 

atgattgctg cctacactgc tgctctagtt 2580 

ggtgctggcg ctgctcttca aatacctttt 2 640 

attggagtta cccaaaatgt tctctatgag 27 00 

aaggcgatta gtcaaattca agaatcactt 27 60 

caagacgttg ttaaccagaa tgctcaagca 2820 

aattttggtg caatttcaag tgtgctaaat 28 80 

gcggaggtac aaattgacag gttaattaca 2940 

acacaacaac taatcagggc tgctgaaatc 3 0 00 

atgtctgagt gtgttcttgg acaatcaaaa 3 060 

cttatgtcct tcccacaagc agccccgcat 3120 

ccatcccagg agaggaactt caccacagcg 3180 

ttccctcgtg aaggtgtttt tgtgtttaat 3240 

ttcttttctc cacaaataat tactacagac 3300 

gttattggca tcattaacaa cacagtttat 3 3 60 

aaagaagagc tggacaagta cttcaaaaat 3420 

atttcaggca ttaacgcttc tgtcgtcaac 3480 

gtcgctaaaa atttaaatga atcactcatt 3 540, 

tatattaaat ggccttggta tgtttggctc 3 600- 

atggttacaa tctfcgctttg ttgcatgact 3660 

tcttgtggtt cttgctgcaa gtttgatgag 3 720 

aaattacatt acacataa 3768 



Sequence 



45 <213> OrganisraName : SARS corona 1 
<400> PreSequenceString : 
atgtttattt tcttattatt tcttactctc 
acttttgatg atgttcaagc tcctaattac 
tactatcctg atgaaatttt tagatcagac 

50 ccattttatt ctaatgttac agggtttcat 
atacctttta aggatggtat ttattttgct 
tgggtttttg gttctaccat gaacaacaag 
actaatgttg ttatacgagc atgtaacttt 
tctaaaccca tgggtacaca gacacatact 

55 ttcgagtaca tatctgatgc cttttcgctt 
cacttacgag agtttgtgtt taaaaataaa 
caacctatag atgtagttcg tgatctacct 
aagttgcctc ttggfcattaa cattacaaat 
gctcaagaca tttggggcac gtcagctgca 

60 acatttatgc tcaagtatga tgaaaatggt 
aatccacttg ctgaactcaa atgctctgtt 
cagacctcta atttcagggt tgttccctca 
aacttgtgtc cttttggaga ggtttttaat 
gagagaaaaa aaatttctaa ttgtgttgct 

65 ttttcaacct ttaagtgcta tggcgtttct 
aatgtctatg cagattcttt tgtagtcaag 
caaactggtg ttattgctga ttataattat 



irus CUHK-SulO 

actagtggta gtgaccttga ccggtgcacc 60 

actcaacata cttcatctat gaggggggtt 12 0 

actctttatt taactcagga tttatttctt 180 

actattaatc atacgtttgg caaccctgtc 240 

gccacagaga aatcaaatgt tgtccgtggt 3 00 

tcacagtcgg tgattattat taacaattct 3 60 

gaattgtgtg acaacccttt ctttgctgtt 420 

atgatattcg ataatgcatt taattgcact 480 

gatgtttcag aaaagtcagg taattttaaa 540 

gatgggtttc tctatgttta taagggctat 600 

tctggtttta acactttgaa acctattttt 660 

tttagagcca ttcttacagc cttttcacct 720 

gcctattttg ttggctattt aaagccaact 7 80 

acaatcacag atgctgttga ttgttctcaa 840 

aagagctttg agattgacaa aggaatttac 9 00 

ggagatgttg tgagattccc taatattaca 960 

gctactaaat tcccttctgt ctatgcatgg 1020 

gattactctg tgctctacaa ctcaacattt 1080 

gccactaagt fcgaatgatct ttgcttctcc 1140 

ggagatgatg taagacaaat agcgccagga 12 00 

aaattgccag atgatttcat gggttgtgtc 12 60 
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cttgcttgga atactaggaa cattgatgct acttcaactg gtaattataa ttataaatat 1320 

aggtatctta gacatggcaa gcttaggccc tttgagagag acatatctaa tgtgcctttc" 13 80 

tcccctgatg gcaaaccttg caccccacct gctcttaatt gttattggcc attaaatgat 1440 

tatggttttt acaccactac tggcattggc taccaacctt acagagttgt agtactttct 1500 

5 tttgaacfctt taaatgcacc ggccacggtt tgtggaccaa aattatccac tgaccttatt 1560 

aagaaccagt gtgtcaattt taattttaat ggactcactg gtactggtgt gttaactcct 162 0 

tcttcaaaga gatttcaacc atttcaacaa tttggccgtg atgtttctga tttcactgat 1680 

tccgttcgag atcctaaaac atctgaaata ttagacattt caccttgctc ttttgggggt 174 0 

gtaagtgtaa ttacacctgg aacaaatgct tcatctgaag ttgctgttct atatcaagat 1800 

10 gttaactgca ctgatgtttc tacagcaatt catgcagatc aactcacacc agcttggcgc I860 

atatattcta ctggaaacaa tgtattccag actcaagcag gctgtcttat aggagctgag 192 0 

cafcgtcgaca cttcttatga gtgcgacatt cctattggag ctggcatttg tgctagttac 198 0 

catacagttt ctttattacg tagtactagc caaaaatcta ttgtggcfcta tactatgtct 2040 

ttaggtgctg atagttcaat- tgcttactct aataacacca ttgctatacc tactaaettt ' -2100 

15 tcaattagca ttactacaga agtaatgcct gtttctatgg ctaaaacctc cgtagattgt 2160 

aatatgtaca tctgcggaga ttctactgaa tgtgctaatt tgcttctcca atatggtagc 2220 

ttttgcacac aactaaatcg tgcactctca ggtattgctg ctgaacagga tcgcaacaca 22 80 

cgtgaagtgt tcgctcaagt caaacaaatg tacaaaaccc caactttgaa atattttggt 2340 

ggttttaatt tttcacaaat attacctgac cctctaaagc caactaagag gtcttttatt 2400 

20 gaggacttgc tctttaataa ggtgacactc gctgatgctg gcttcatgaa gcaatatggc 2460 

gaatgcctag gtgatattaa tgctagagat ctcatttgtg cgcagaagtt caatggactt 2520 

acagtgttgc cacctctgct cactgatgat atgattgctg cctacactgc tgctcfcagtt 2580 

agtggtactg ccactgctgg atggacattt ggtgctggcg ctgctcttca aatacctttt 2640 

gctatgcaaa tggcatatag gttcaatggc attggagtta cccaaaatgt tctctatgag 27 0 0 

25 aaccaaaaac aaatcgccaa ccaatttaac aaggcgatta gtcaaattca agaatcactt 2760 

acaacaacat caactgcatt gggcaagctg caagacgttg ttaaccagaa tgctcaagca 2820 

ttaaacacac ttgttaaaca acttagctct aattttggtg caatttcaag tgtgctaaat 28 80 

gatatccttt cgcgacttga taaagtcgag gcggaggtac aaattgacag gttaattaca 2940 

ggcagacttc aaagccttca aacctatgta acacaacaac taatcagggc tgctgaaatc 3 0 00 

30 agggcttctg ctaatcttgc tgctactaaa atgtctgagt gtgttcttgg acaatcaaaa 3060 

agagttgact tttgtggaaa gggctaccac cttatgtcct tcccacaagc agccccgcat 3120 

ggtgttgtct tcctacatgt cacgtafcgtg ccafccccagg agaggaactt caccacagcg 3180 

ccagcaattt gtcatgaagg caaagcatac ttccctcgtg aaggtgtttt tgtgtttaat 3240 

ggcacttctt ggtttattac acagaggaac ttcttttctc cacaaataat tactacagac 33 00 

35 aatacatttg tctcaggaaa ttgtgatgtc gttattggca tcattaacaa cacagtttat 33 60 

gatcctctgc aacctgagct: tgactcattc aaagaagagc tggacaagta cttcaaaaat 3420 

catacatcac cagatgttga tctfcggcgac atttcaggca ttaacgcttc tgtcgtcaac 3480 

attcaaaaag aaattgaccg cctcaatgag gtcgctaaaa atttaaatga atcactcatt 3540 

gaccttcaag aattgggaaa atatgagcaa tatafctaaat ggccttggta tgtttggctc 3600 

40 ggcttcafctg ctggactaat tgccatcgtc atggttacaa tcttgctttg ttgcatgact 3660 

agttgttgca gttgcctcaa gggtgcatgc tcttgtggtt cttgctgcaa gtttgatgag 3720 

gatgactctg agccagttct caagggtgtc aaattacatt acacataa 3768 
<212> Type : DNA 
<211> Length : 3768 

45 SequenceNarae : SEQ ID 652 

SequenceDescription : 



Sequence 



50 <213> OrganismName : SARS coronavirus Urbani 
<400> PreSequenceString : 

atgtttattt tcttattatt tcttactctc actagtggta gtgaccttga ccggtgcacc 60 

acttttgatg atgttcaagc tcctaattac actcaacata cttcatctat gaggggggtt 120 

tactatcctg atgaaatttt tagatcagac actctttatt taactcagga tttatttctt 180 

55 ccattttatt ctaatgttac agggtttcat actattaatc atacgtttgg caaccctgtc 240 

atacctttta aggatggtat ttattttgct gccacagaga aatcaaatgt tgtccgtggt 300 

t9gg tt:ttt g gttctaccat gaacaacaag tcacagtcgg tgattattat taacaattct 3 60 

actaatgttg ttatacgagc atgtaacttt gaattgtgtg acaacccttt ctttgcfcgtt 420 

tctaaaccca tgggtacaca gacacatact atgatattcg ataatgcatt taattgcact 480 

60 ttcgagtaca tatctgatgc cttttcgctt gatgtttcag aaaagtcagg taattttaaa 540 

cacttacgag agtttgtgtt taaaaataaa gatgggtttc tctatgttta taagggctat 600 

caacctatag atgtagttcg tgatctacct tctggtttta acactttgaa acctattttt 660 

aagttgcctc ttggtattaa cattacaaat tttagagcca ttcttacagc cttttcacct 720 

gctcaagaca tttggggcac gtcagctgca gcctattttg ttggctattt aaagccaact 780 

65 acatttatgc tcaagtatga tgaaaatggt acaatcacag atgctgttga ttgttctcaa 840 

aatccacttg ctgaactcaa atgctctgtt aagagctttg agattgacaa aggaatttac 900 

cagacctcta atttcagggt tgttccctca ggagatgttg tgagattccc taatatfcaca 960 
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aacttgtgtc cttttggaga ggtttttaat 
gagagaaaaa aaatttctaa ttgtgttgct 
ttttcaacct ttaagtgcta tggcgtttct 
aatgtctatg cagattcttt tgtagtcaag 
5 caaactggtg ttattgctga ttataattat 
cttgcttgga atactaggaa cattgatgct 
aggtatctta gacatggcaa gcttaggccc 
tcccctgatg gcaaaccttg caccccacct 
tatggttttt acaccactac tggcattggc 

10 tttgaacttt taaatgcacc ggccacggtt 
aagaaccagt gtgtcaattt taattttaat 
tcttcaaaga gatttcaacc atttcaacaa 
tccgttcgag atcctaaaac atctgaaata 
gtaagtgtaa ttacacctgg aacaaatgct 

15 gttaactgca ctgatgtttc tacagcaatt 
atatattcta ctggaaacaa tgtattccag 
catgtcgaca cttcttatga gtgcgacatt 
catacagttt ctttattacg tagtactagc 
ttaggtgctg atagttcaat tgcttactct 

20 tcaattagca ttactacaga agtaatgcct 
aatatgtaca tctgcggaga ttctactgaa 
ttttgcacac aactaaatcg tgcactctca 
cgtgaagtgt tcgctcaagt caaacaaatg 
ggttttaatt tttcacaaat attacctgac 

25 gaggacttgc tctttaataa ggtgacactc 
gaatgcctag gtgatattaa tgctagagat 
acagtgttgc cacctctgct cactgatgat 
agtggtactg ccactgctgg atggacattt 
gctatgcaaa tggcatatag gttcaatggc 

30 aaccaaaaac aaatcgccaa ccaatttaac 
acaacaacat caactgcatt gggcaagctg 
ttaaacacac ttgttaaaca acttagctct 
gatatccttt cgcgacttga taaagtcgag 
ggcagacttc aaagccttca aacctatgta 

35 agggcttctg ctaatcttgc tgctactaaa 
agagttgact tttgtggaaa gggctaccac 
ggtgttgtct tcctacatgt cacgtatgtg 
ccagcaattt gtcatgaagg caaagcatac 
ggcacttctt ggtttattac acagaggaac 

40 aatacatttg tctcaggaaa ttgtgatgtc 
gatcctctgc aacctgagct cgactcattc 
catacatcac cagatgttga tcttggcgac 
attcaaaaag aaattgaccg cctcaatgag 
gaccttcaag aattgggaaa atatgagcaa 

45 ggcttcattg ctggactaat tgccatcgtc 
agttgttgca gttgcctcaa gggtgcatgc 
gatgactctg agccagttct caagggtgtc 
<212> Type : DNA 
<211> Length : 3768 

50 SeguenceNarae : SEQ ID 653 

SequenceDe script ion : 



gctactaaat tcccttctgt ccacgcaugg juu^u 

gattactctg tgctctacaa ctcaacattt 1080 

gccactaagt tgaatgatct ttgcttctcc 1140 

ggagatgatg taagacaaat agcgccagga 1200 

aaattgccag atgatttcat gggttgtgtc 12 60 

acttcaactg gtaattataa ttataaatat 1320 

tttgagagag acatatctaa tgtgcctttc 13 8 0 

gctcttaatt gttattggcc attaaatgat 1440 

taccaacctt acagagttgt agtactttct 1500 

tgtggaccaa aattatccac tgaccttatt 1560 

ggactcactg gtactggtgt gttaactcct 1620 

tttggccgtg atgtttctga tttcactgat 1680 

ttagacattt caccttgctc ttttgggggt 1740 

tcatctgaag, ttgcfcgttct atatcaagat 18-00 

catgcagatc aactcacacc agcttggcgc 18 60 

actcaagcag gctgtcttat aggagctgag 1920 

cctattggag ctggcatttg tgctagttac 1980 

caaaaatcta ttgtggctta tactatgtct 2040 

aataacacca ttgctatacc tactaacttt 2100 

gtttctatgg ctaaaacctc cgtagattgt 2160 

tgtgctaatt tgcttctcca atatggtagc 2220 

ggtattgctg ctgaacagga tcgcaacaca 22 80 

tacaaaaccc caactttgaa atattttggt 2340 

cctctaaagc caactaagag gtcttttatt 2400 

gctgatgctg gcttcatgaa gcaatatggc 2460 

ctcatttgtg cgcagaagtt caatggactt 2520 

atgattgctg cctacactgc tgctctagtt 2580 

ggtgctggcg ctgctcttca aatacctttt 2640 

attggagtta cccaaaatgt tctctatgag 2700 

aaggcgatta gtcaaattca agaatcactt 2760 

caagacgttg ttaaccagaa tgctcaagca 2 820 

aattttggtg caatttcaag tgtgctaaat 2880 

gcggaggtac aaattgacag gttaattaca 2940 

acacaacaac taatcagggc tgctgaaatc 3 000 

atgtctgagt gtgttcttgg acaatcaaaa 3 060 

cttatgtcct tcccacaagc agccccgcat 3120 

ccatcccagg agaggaactt caccacagcg 3180 

ttccctcgtg aaggtgttttr tgtgtttaat 3240 

ttcttttctc cacaaataat tactacagac 3300 

gttattggca tcattaacaa cacagtttat 3360 

aaagaagagc tggacaagta cttcaaaaat 3420 

atttcaggca ttaacgcttc tgtcgtcaac 3480 

gtcgctaaaa atttaaatga atcactcatt 3540 

tatattaaat ggccttggta tgtttggctc 3600 

atggttacaa tcttgctttg ttgcatgact 366 0 

tcttgtggtt cttgctgcaa gtttgatgag 3720 

aaattacatt acacataa 3768 



Sequence 



55 <213> OrganismName : SARS coronavirus 
<400> PreSequenceString : 

atgtttattt tcttattatt tcttactctc actagtggta gtgaccttga ccggtgcacc 60 

acttttgatg atgttcaagc tcctaattac actcaacata cttcatctat gaggggggtt 120 

tactatcctg atgaaatttt tagatcagac actctttatt taactcagga tttatttctt 180 

60 ccattttatt ctaatgttac agggtttcat actattaatc atacgtttgg caaccctgtc 240 

atacctttta aggatggtat ttattttgct gccacagaga aatcaaatgt tgtccgtggt 3 00 

tgggtttttg gttctaccat gaacaacaag tcacagtcgg tgattattat taacaattct 3 60 

actaatgttg ttatacgagc atgtaacttt gaafctgtgtg acaacccttt ctttgctgtt 420 

tctaaaccca tgggtacaca gacacatact atgatattcg ataatgcatt taattgcact 480 

65 ttcgagtaca tatctgatgc cttttcgctt gatgtttcag aaaagtcagg taattttaaa 540 

cacttacgag agtttgtgtt taaaaataaa gatgggtttc tctatgttta taagggctat 600 

caacctatag atgtagttcg tgatctacct tctggtttta acactttgaa acctattttt 660 
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aagttgcctc ttggtattaa cattacaaat tttagagcca ttcttacagc cttttcacct 72 0 

gctcaagaca tttggggcac gtcagctgca gcctattttg ttggctattt aaagccaact 780 

acatttatgc tcaagtatga tgaaaatggt acaatcacag atgctgttga ttgttctcaa 840 

aatccacttg ctgaactcaa atgctctgtt aagagctttg agattgacaa aggaatttac 900 

5 cagacctcta atttcagggt tgttccctca ggagatgttg tgagattccc taatattaca 96 0 

aacttgtgtc cttttggaga ggtttttaat gctactaaat tcccttctgt ctatgcatgg 102 0 

gagagaaaaa aaatttctaa ttgtgttgct gattactctg tgctctacaa ctcaacattt 1080 

ttttcaacct ttaagtgcta tggcgtttct gccactaagt tgaatgatct ttgcttctcc 114 0 

aatgtctatg cagattcttt tgtagtcaag ggagatgatg taagacaaat agcgccagga 12 0 0 

10 caaactggtg ttattgctga ttataattat aaattgccag atgatttcat gggttgtgtc 12 60 

cttgcttgga atactaggaa cattgatgct acttcaactg gtaattataa ttataaatat 1320 

aggtatctta gacatggcaa gctfcaggccc tttgagagag acatatctaa tgtgcctttc 13 8 0 

tcccctgatg gcaaaccttg caccccacct gctcttaatt gttattggcc attaaatgat 1440 

tatggttttt apaccactac. tggcattggc taccaacctt acagagktgt agtaotttcl;, 15*00.,, 

15 tttgaacttt taaatgcacc ggccacggtt tgtggaccaa aattatccac tgaccttatt 1560 

aagaaccagt gtgtcaattt taattttaat ggactcactg gtactggtgt gttaactcct 1620 

tcttcaaaga gatttcaacc atttcaacaa tttggccgtg atgtttctga tttcactgat 1680 

tccgttcgag atcctaaaac atctgaaata ttagacattt caccttgcgc ttttgggggt 1740 

gtaagtgtaa ttacacctgg aacaaatgct tcatctgaag ttgctgttct atatcaagat 1800 

20 gttaactgca ctgatgtttc tacagcaatt catgcagatc aactcacacc agcttggcgc 18 60 

atatattcta ctggaaacaa tgtattccag actcaagcag gctgtcttat aggagctgag 1920 

catgtcgaca cttcttatga gtgcgacatt cctattggag ctggcatttg tgctagttac 1980 

catacagttt ctttattacg tagtactagc caaaaatcta ttgtggctta tactatgtct 204 0 

ttaggtgctg atsagttcaat tgcttactct aataacacca ttgctatacc tactaacttt 2100 

25 tcaattagca ttactacaga agtaatgcct gtttctatgg ctaaaacctc cgtagattgt 2160 

aatatgtaca tctgcggaga ttctactgaa tgtgctaatt tgcttctcca atatggtagc 2220 

ttttgcacac aactaaatcg tgcactctca ggtattgctg ctgaacagga tcgcaacaca 22 80 

cgtgaagtgt tcgctcaagt caaacaaatg tacaaaaccc caactttgaa atattttggt 2340 

ggttttaatt tttcacaaat attacctgac cctctaaagc caactaagag gtcttttatt 2400 

30 gaggacttgc tctttaataa ggtgacactc gctgatgctg gcttcatgaa gcaatatggc 24 60 

gaatgcctag gtgatattaa tgctagagat ctcatttgtg cgcagaagtt caatggactt 2520 

acagtgttgc cacctctgct cactgatgat atgattgctg cctacactgc tgctctagtt 2580 

.agtggtactg ccactgctgg atggacattt ggtgctggcg ctgctcttca aatacctttt 2640 

gctatgcaaa tggcatatag gttcaatggc attggagtta cccaaaatgt tctctatgag 2700 

35 aaccaaaaac aaatcgccaa ccaatttaac aaggcgatta gtcaaattca agaatcactt 2760 

acaacaacat caactgcatt gggcaagctg caagacgttg ttaaccagaa tgctcaagca 2820 

ttaaacacac ttgttaaaca acttagctct aattttggtg caatttcaag tgtgctaaat 2880 

gatatccttt cgcgacttga taaagtcgag gcggaggtac aaattgacag gttaattaca 2940 

ggcagacttc aaagccttca aacctatgta acacaacaac taatcagggc tgctgaaatc 30 00 

40 agggcttctg ctaatcttgc tgctactaaa atgtctgagt gtgttcttgg acaatcaaaa 3060 

agagttgact tttgtggaaa gggctaccac cttatgtcct tcccacaagc agccccgcat 312 0 

ggtgttgtct tcctacatgt cacgtatgtg ccatcccagg agaggaactt caccacagcg 3180 

ccagcaattt gtcatgaagg caaagcatac ttccctcgtg aaggtgtttt tgtgtttaat 3240 

ggcacttctt ggtttattac acagaggaac ttcttttctc cacaaataat tactacagac 33 00 

45 aatacatttg tctcaggaaa ttgtgatgtc gttattggca tcattaacaa cacagtttat 336 0 

gatcctctgc aacctgagct tgactcattc aaagaagagc tggacaagfca cttcaaaaat 342 0 

catacatcac cagatgttga tcttggcgac atttcaggca ttaacgcttc tgtcgtcaac 3480 

attcaaaaag aaattgaccg cctcaatgag gtcgctaaaa atttaaatga atcactcatt 3540 

gaccttcaag aattgggaaa atatgagcaa tatattaaat ggccttggta tgtttggctc 3600 

50 ggcttcattg ctggactaat tgccatcgtc atggttacaa tcttgctttg ttgcatgact 3 660 

agttgttgca gttgcctcaa gggtgcatgc tcttgtggtt cttgctgcaa gtttgatgag 3720 

gatgactctg agccagttct caagggtgtc aaattacatt acacataa 3768 
<212> Type : DMA 
<211> Length : 3768 

55 SequenceName SEQ ID 654 

SequenceDescription : 

Sequence 



60 <213> OrganismName : SARS coronavirus TOR2 
<400> PreSequenceString : 

atgtttattt tcttattatt tcttactctc actagtggta gtgaccttga ccggtgcacc 60 

acttttgatg atgttcaagc tcctaattac actcaacata cttcatctat gaggggggtt 120 

tactatcctg atgaaatttt tagatcagac actctttatt taactcagga tttatttctt 180 

65 ccattttatt ctaatgttac agggtttcat actattaatc atacgtttgg caaccctgtc 240 

atacctttta aggatggtat ttattttgct gccacagaga aatcaaatgt tgtccgtggt 3 00 

tgggtttttg gttctaccat gaacaacaag tcacagtcgg tgattattat taacaattct 360 
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actaatgttg ttatacgagc atgtaacttt gaattgtgtg acaacccttt ctttgctgtt 42 0 

tctaaaccca tgggtacaca gacacatact atgatattcg ataatgcatt taattgcact 48 0 

ttcgagtaca tatctgatgc cttttcgctt gatgtttcag aaaagtcagg taattttaaa 54 0 

cacttacgag agtttgtgtt taaaaataaa gatgggtttc tctatgttta taagggctat 60 0 

5 caacctatag atgtagttcg tgatctacct tctggtttta acactttgaa acctattttt 660 

aagttgcctc ttggtattaa cattacaaat tttagagcca ttcttacagc cttttcacct 72 0 

gctcaagaca tttggggcac gtcagctgca gcctattttg ttggctattt aaagccaact 78 0 

acattfcatgc tcaagtatga tgaaaatggt acaatcacag atgctgttga ttgttctcaa 84 0 

aatccacttg ctgaactcaa atgctctgtt aagagctttg agattgacaa aggaatttac 90 0 

10 cagacctcta atttcagggt tgttccctca ggagatgttg tgagattccc taatattaca 960 

aacttgtgtc cttttggaga ggtttttaat gctactaaat tcccttctgt ctatgcatgg 102 0 

gagagaaaaa aaatttctaa ttgtgttgct gattactctg tgctctacaa ctcaacattt 10 8 0 

ttttcaacct ttaagtgcta tggcgtttct gccactaagt tgaatgatct ttgcttctcc 114 0 

a&tgtctatg cagattcttt tgtagtcaag ggagatgatg taagacaaat agcgccagga .1200 

15 caaactggtg ttattgctga ttataattat aaattgccag atgatttcat gggttgtgtc 1260 

cttgcttgga atactaggaa cattgatgct acttcaactg gtaattataa ttataaatat 1320 

aggtatctta gacatggcaa gcttaggccc tttgagagag acatatctaa tgtgcctttc 13 80 

tcccctgatg gcaaaccttg caccccacct gctcttaatt gttattggcc attaaatgat 144 0 

tatggttttt acaccactac tggcattggc taccaacctt acagagttgt agtactttct 1500 

20 tttgaacttt taaatgcacc ggccacggtt tgtggaccaa aattatccac tgaccttatt 1560 

aagaaccagt gtgtcaattt taattttaat ggactcactg gtactggtgt gttaactcct 162 0 

tcttcaaaga gatttcaacc atttcaacaa tttggccgtg atgtttctga tttcactgat 1680 

tccgttcgag atcctaaaac atctgaaata ttagacattt caccttgcgc ttttgggggt 174 0 

gtaagtgtaa ttacacctgg aacaaatgct tcatctgaag ttgctgttct atatcaagat 180 0 

25 gttaactgca ctgatgtttc tacagcaatt catgcagatc aactcacacc agcttggcgc 1860 

atatattcta ctggaaacaa tgtattccag actcaagcag gctgtcttat aggagctgag 192 0 

catgtcgaca cttcttatga gtgcgacatt cctattggag ctggcatttg tgctagttac 198 0 

catacagttt ctttattacg tagtactagc caaaaatcta ttgtggctta tactatgtct 2 040 

ttaggtgctg atagttcaat tgcttactct aataacacca ttgctatacc tactaacttt 2100 

30 tcaattagca ttactacaga agtaatgcct gtttctatgg ctaaaacctc cgtagattgt 2160 

aatatgtaca tctgcggaga ttctactgaa tgtgctaatt tgcttctcca atatggtagc 222 0 

ttttgcacac aactaaatcg fcgcactctca ggtattgctg ctgaacagga tcgcaacaca 228 0 

cgtgaagtgt tcgctcaagt caaacaaatg tacaaaaccc caactttgaa atattttggt 2340 

ggttttaafct tttcacaaat attaccrtgac cctctaaagc caactaagag gtcttttatt 2400 

35 gaggacttgc tctttaataa ggtgacactc gctgatgctg gcttcatgaa gcaatatggc 2460 

gaatgcctag gtgatattaa tgctagagat ctcatttgtg cgcagaagtt caatggactt 2520 

acagtgttgc cacctctgct cactgatgat atgattgctg cctacactgc tgctctagtt 2580 

agtggtactg ccactgctgg atggacattt ggtgcfcggcg ctgctcttca aatacctttt 2640 

gctatgcaaa tggcatatag gttcaatggc afctggagtta cccaaaatgt tctctatgag 2700 

40 aaccaaaaac aaatcgccaa ccaatttaac aaggcgatta gtcaaattca agaatcactt 2760 

acaacaacat eaactgcatt gggcaagctg caagacgttg ttaaccagaa tgctcaagca 282 0 

ttaaacacac ttgttaaaca acttagctct aattttggtg caatttcaag tgtgctaaat 2880 

gatatccttt cgcgacttga taaagtcgag gcggaggtac aaattgacag gttaattaca 294 0 

ggcagacttc aaagccttca aacctatgta acacaacaac taatcagggc tgctgaaatc 3000 

45 agggcttctg ctaatcttgc tgctactaaa atgtctgagt gtgttcttgg acaatcaaaa 3060 

agagttgact tttgtggaaa gggctaccac cttatgtcct tcccacaagc agccccgcat 3120 

ggtgttgtct tcctacatgt cacgtatgtg ccatcccagg agaggaactt caccacagcg 3180 

ccagcaattt gtcatgaagg caaagcatac ttccctcgtg aaggtgtttt tgtgtttaat 324 0 

ggcacttctt ggtttattac acagaggaac ttcttttctc cacaaataat tactacagac 3300 

50 aatacatttg tctcaggaaa ttgtgatgtc gttattggca tcattaacaa cacagtttat 3 3 60 

gatcctctgc aacctgagct tgactcattc aaagaagagc tggacaagta cttcaaaaat 3420 

catacatcac cagatgttga tcttggcgac atttcaggca ttaacgcttc tgtcgtcaac 348 0 

attcaaaaag aaattgaccg cctcaatgag gtcgctaaaa atttaaatga atcactcatt 3540 

gaccttcaag aattgggaaa atatgagcaa tatattaaat ggccttggta tgtttggctc 3 600 

55 ggcttcattg ctggactaat tgccatcgtc atggttacaa tcttgctttg ttgcatgact 3 660 

agttgttgca gttgcctcaa gggtgcatgc tcttgtggtt cttgctgcaa gtttgatgag 3 72 0 

gatgactctg agccagttct caagggtgtc aaattacatt acacataa 3768 
<212> Type : DNA 
<211> Length : 3768 
60 SequenceName : SEQ ID 655 

SequenceDescription : 

Sequence 



65 



<213> OrganismName : SARS coronavirus GD01 
<40 0> PreSequenceString : 

atgtttattt tcttattatt tcttactctc actagtggta gtgaccttga ccggtgcacc 



60 
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acttttgatg atgttcaagc tcctaattac 

tactatcctg atgaaatttt tagatcagac 

ccattttatt ctaatgttac agggtttcat 

atacctttta aggatggtat ttattttgct 

5 tgggtttttg gttctaccat gaacaacaag 

actaatgttg ttatacgagc atgtaacttt 

tctaaaccca tgggtacaca gacacatact 

ttcgagtaca tatctgatgc cttttcgctt 

cacttacgag agtttgtgtt taaaaataaa 

10 caacctatag atgtagttcg tgatctacct 

aagttgcctc ttggtattaa cattacaaat 

gctcaagaca cttggggcac gtcagctgca 

acatttatgc tcaagtatga tgaaaatggt 

aatecaettg ctgaactcaa atgatctgfct 

15 cagacctcta atttcagggt tgttccctca 

aacttgtgtc cttttggaga ggtttttaat 

gagaggaaaa gaatttctaa ttgtgttgct 

ttttcaacct ttaagtgcta fcggcgtttct 

aatgtctatg cagafctcttt tgtagtcaag 

20 caaactggtg ttattgctga ttataattat 

cttgcttgga atactaggaa cattgatgct 

aggtatctta gacatggcaa gcttaggccc 

tcccctgatg gcaaaccttg caccccacct 

tatggttttt acaccactac tggcattggc 

25 tatgaacttt taaatgcacc ggccacggtt 

aagaaccagt gtgtcaattt taattttaat 

tcttcaaaga gatttcaacc atttcaacaa 

tccgttcgag atcctaaaac atctgaaata 

gtaagtgtaa ttacacctgg aacaaatgct 

30 gttaactgca ctgafcgtttc tacagcaatt 

atatattcta ctggaaacaa tgtattccag 

catgtcgaca cttcttafcga gtgcgacatt 

catacagttt ctttattacg tagtactagc 

ttaggtgctg atagfctcaat tgcttactct 

35 tcaattagca ttactacaga agtaatgcct 

aatatgtaca tctgcggaga ttctactgaa 

ttttgcacac aactaaafccg tgcactctca 

cgtgaagtgt tcgctcaagt caaacaaatg 

ggttttaatt tttcacaaat attacctgac 

40 gaggacttgc tctttaataa ggtgacactc 

gaatgcctag gtgatattaa tgctagagat 

acagtgttgc cacctctgct cactgatgat 

agtggtactg ccactgctgg atggacattt 

gctatgcaaa tggcatatag gttcaatggc 

45 aaccaaaaac aaatcgccaa ccaatttaac 

acaacaacat caactgcatt gggcaagctg 

ttaaacacac ttgttaaaca acttagctct 

gatatccttt cgcgacttga taaagtcgag 

ggcagacttc aaagccttca aacctatgta 

50 agggcttctg ctaatcttgc tgctactaaa 

agagttgact tttgcggaaa gggctaccac 

ggtgttgtct tcctacatgt cacgtatgtg 

ccagcaattt gtcatgaagg caaagcatac 

ggcacttctt ggtttattac acagaggaac 

55 aatacatttg tctcaggaaa ttgtgatgtc 

gatcctctgc aacctgagct tgactcattc 

catacatcac cagatgttga tcttggcgac 

attcaaaaag aaattgaccg cctcaatgag 

gaccttcaag aattgggaaa atatgagcaa 

60 ggcttcattg ctggactaat fcgccatcgtc 

agttgttgca gttgcctcaa gggtgcatgc 

gatgactctg agccagttct caagggtgtc 

<212> Type : DNA 
<211> Length. : 3768 

65 SequenceNarae : SEQ ID 656 

SequenceDescription : 



actcaacata cttcatctat gaggggggtt 12 0 

actctttatt taactcagga tttatttctt 180 

actattaatc atacgtttga caaccctgtc 240 

gccacagaga aatcaaatgt tgtccgtggt 3 00 

tcacagtcgg tgattattat taacaattct 360 

gaattgtgtg acaacccttt ctttgctgtt 42 0 

atgatattcg ataatgcatt taattgcact 480 

gatgtttcag aaaagtcagg taattttaaa 540 

gatgggtttc tctatgttta taagggctat 600 

tctggtttta acactttgaa acccattttt 660 

tttagagcca ttcttacagc ctttttacct 720 

gcctattttg ttggctattt aaagccaact 78 0 

acaatcacag atgctgttga ttgttctcaa 84 0 

aagagctttg agattgacasi figgaatttac , ,-900 

agagatgttg tgagattccc taatattaca 960 

gctactaaat tcccttctgt ctatgcatgg 10 20 

gattactctg tgctctacaa ctcaacattt 1080 

gccactaagt tgaatgatct ttgcttctcc 1140 

ggagatgatg taagacaaat agcgccagga 1200 

aaattgccag atgatttcat gggttgtgtc 12 60 

acttcaactg gtaattataa ttataaatat 13 20 

tttgagagag acatatctaa tgtgcctttc 13 80 

gctcttaatt gttattggcc attaaatgat 1440 

taccaacctt acagagttgt agtactttct 15 00 

tgtggaccaa aattatccac tgaccttatt 1560 

ggactcactg gtactggtgt gttaactcct 162 0 

tttggccgtg atgtttctga tttcactgat 1680 

ttagacattt caccttgctc ttttgggggt • 1740 

tcatctgaag ttgctgttct atatcaagat 1800 

catgcagatc aactcacacc agcttggcgc 18 60 

actcaagcag gctgtcttat aggagctgag 1920 

cctattggag ctggcatttg tgctagttac 1980 

caaaaatcta ttgtggctta tactatgtct 2040 

aataacacca ttgctatacc tactaacttt 2100 

gtttctatgg ctaaaacctc cgtagattgt 2160 

tgtgctaatt tgcttctcca atatggtagc 222 0 

ggtattgctg ctgaacagga tcgcaacaca 2280 

tacaaaaccc caactttgaa agattttggt 2340 

cctctaaagt caactaagag gtcttttatt 2400 

gctgatgctg gcttcatgaa gcaatatggc 2460 

ctcatttgtg cgcagaagtt caatggactt 2520 

atgattgctg cctacactgc tgctctagtt 2580 

ggtgctggcg ctgctcttca aatacctttt 2640 

attggagtta cccaaaatgt tctctatgag 2700 

aaggcgatta gtcaaattca agaatcactt 2 760 

caagacgttg ttaaccagaa tgctcaagca 2820 

aattttggtg caatttcaag tgtgctaaat 2880 

gcggaggtac aaattgacag gttaattaca 2940 

acacaacaac taatcagggc tgctgaaatc 3 0 00 

atgtctgagt gtgttcttgg acaatcaaaa 3 060 

cttatgtcct tcccacaagc agccccgcat 3120 

ccatcccagg agaggaactt caccacagcg 3180 

ttccctcgtg aaggtgtttt tgtgtttaat 3240 

ttcttttctc cacaaataat tactacagac 33 00 

gttattggca tcattaacaa cacagtttat 3 3 60 

aaagaagagc tggacaagta cttcaaaaat 3420 

atttcaggca ttaacgcttc tgtcgtcaac 348 0 

gtcgctaaaa atttaaatga atcactcatfc 3 540 

tatattaaat ggccttggta tgtttggctc 3 600 

atggttacaa tcttgctttg ttgcatgact 3 660 

tcttgtggtt cttgctgcaa gtttgatgag 3720 

aaattacatt acacataa 3768 
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<213> OrganismName : SARS coronavirus CUHK-W1 
<4 00> PreSequenceString : 

5 atgtttattt tcttattatt tcttactctc actagtggta gtgaccttga ccggtgcacc 60 

acttttgatg atgttcaagc tcctaattac actcaacata cttcatctat gaggggggtt 120 

tactatcctg atgaaatttt tagatcagac actctttatt taactcagga tttatttctt 180 

ccattttatt ctaatgttac agggtttcat actattaatc atacgtttga caaccctgtc 240 

atacctttta aggatggtat ttattttgct gccacagaga aatcaaatgt tgtccgtggt 3 00 

10 tgggtttttg gttctaccat gaacaacaag tcacagtcgg tgattattat taacaattct 3 60 

actaatgttg ttatacgagc atgfcaacttt gaattgtgtg acaacccttt ctttgctgtt 420 

tctaaaccca tgggtacaca gacacatact atgatattcg ataatgcatt taattgcact 480 

ttcgagtaca tatctgatgc cttttcgctt gatgtttcag aaaagtcagg taattttaaa 540 

, cacttacgag agtt-tgtgtt taaaaataaa gatgggtttc fcctatgttta taagggctat 6,0,0. 

15 caacctatag atgtagttcg tgatctacct tctggtttta acactttgaa acctattttt 660 

aagttgcctc ttggtattaa cattacaaat tttagagcca ttcttacagc cttttcacct 72 0 

gctcaagaca cttggggcac gtcagctgca gcctattttg ttggctattt aaagccaact 780 

acatttatgc tcaagtatga tgaaaatggt acaatcacag atgctgttga ttgttctcaa 840 

aatccacttg ctgaactcaa atgctctgtt aagagctttg agattgacaa aggaatttac 9 00 

20 cagacctcta atttcagggt tgttccctca ggagatgttg tgagattccc taatattaca 960 

aacttgtgtc cttttggaga ggtttttaat gctactaaat tcccttctgt ctatgcatgg 1020 

gagagaaaaa aaatttctaa ttgtgttgct gattactctg tgctctacaa ctcaacattt 1080 

ttttcaacct ttaagtgcta tggcgtttct gccactaagt tgaatgatct ttgcttctcc 1140 

aatgtctatg cagattcttt tgtagtcaag ggagatgatg taagacaaat agcgccagga 12 00 

-25 caaactggtg ttattgctga ttataattat aaattgccag atgatttcat: gggttgtgtc 12 60 

cttgcttgga atactaggaa cattgatgct acttcaactg gtaattataa ttataaatat 13 2 0 

aggtatctta gacatggcaa gcttaggccc tttgagagag acatatctaa tgtgcctttc 13 80 

tcccctgatg gcaaaccttg caccccacct gctcttaatt gttattggcc attaaatgat 1440 

tatggttttt acaccactac tggcattggc taccaacctt acagagttgt agtactttct 15 00 

30 tttgaacttt taaatgcacc ggccacggtt tgtggaccaa aattatccac tgaccttatt 15 60 

aagaaccagt gtgtcaattt taattttaat ggactcactg gtactggtgt gttaactcct 1620 

tcttcaaaga gatttcaacc atttcaacaa tttggccgtg atgtttctga tttcactgat 1680 

tccgttcgag atccrtaaaac atctgaaata ttagacattt caccttgctc ttttgggggt 1740 

gtaagtgtaa fcfcacacctgg aacaaatgct tcatctgaag ttgctgttct atatcaagat 18 00 

35 gttaactgca cfcgatgtttc tacagcaatt catgcagatc aactcacacc agcttggcgc 18 60 

atatattcta ctggaaacaa tgfcattccag actcaagcag gctgtcttat aggagctgag 1920 

catgtcgaca cttcttatga gtgcgacatt cctattggag ctggcafcttg tgctagttac 19 80 

catacagttt ctttattacg tagtactagc caaaaatcta ttgtggctta tactatgtct 2040 

ttaggtgctg atagttcaat tgcttactct aataacacca ttgctatacc tactaacttt 210 0 

40 tcaattagca ttactacaga agtaatgcct gtttctatgg ctaaaacctc cgtagattgt 2160 

aatatgtaca tctgcggaga ttctactgaa tgtgctaatt tgcttctcca atatggtagc 222 0 

ttttgcacac aactaaatcg tgcactctca ggtattgctg ctgaacagga tcgcaacaca 22 80 

cgtgaagtgt tcgctcaagt caaacaaatg tacaaaaccc caactttgaa atattttggt 23 40 

ggttttaatt tttcacaaat attacctgac cctctaaagc caactaagag gtcttttatt 24 00 

45 gaggacttgc tctttaataa ggtgacactc gctgatgctg gcttcatgaa gcaatatggc 2460 

gaatgcctag gtgatattaa tgctagagat ctcatttgtg cgcagaagtt caatggactt 252 0 

acagtgttgc cacctctgct cactgatgat atgattgctg cctacactgc tgctctagtt 25 80 

agtggtactg ccactgctgg atggacattt ggtgctggcg ctgctcttca aatacctttt 2640 

gctatgcaaa tggcatatag gttcaatggc attggagtta cccaaaatgt tctctatgag 27 00 

50 aaccaaaaac aaatcgccaa ccaatttaac aaggcgatta gtcaaattca agaatcactt 2760 

acaacaacat caactgcatt gggcaagctg caagacgttg ttaaccagaa tgctcaagca 28 20 

ttaaacacac ttgttaaaca acttagctct aattttggtg caatttcaag tgtgctaaat 28 80 
gatatccttt cgcgacttga taaagtcgag gcggaggtac aaattgacag gttaattaca ' 2940 

ggcagacttc aaagccttca aacctatgta acacaacaac taatcagggc tgctgaaatc 30 00 

55 agggcttctg ctaatcttgc tgctactaaa atgtctgagt gtgttcttgg acaatcaaaa 3060 

agagttgact tttgtggaaa gggctaccac cttatgtcct tcccacaagc agccccgcat 3120 

ggtgttgtct tcctacatgt cacgtatgtg ccatcccagg agaggaactt caccacagcg 3180 

ccagcaattt gtcatgaagg caaagcatac ttccctcgtg aaggtgtttt tgtgtttaat 3240 

ggcacttctt ggtttattac acagaggaac ttcttttctc cacaaataat tactacagac 33 00 

60 aatacatttg tctcaggaaa ttgtgatgtc gttattggca tcattaacaa cacagtttat 33 60 

gatcctctgc aacctgagct tgactcattc aaagaagagc tggacaagta cttcaaaaat 342 0 

catacatcac cagatgttga tcttggcgac atttcaggca ttaacgcttc tgtcgtcaac 3480 

attcaaaaag aaattgaccg cctcaatgag gtcgctaaaa atttaaatga atcactcatt 3540 

gaccttcaag aattgggaaa atatgagcaa tatattaaat ggccttggta tgtttggctc 3600 

65 ggcttcattg ctggactaat tgccatcgtc atggttacaa tcttgctttg ttgcatgact 3 660 

agttgttgca gttgccfccaa gggtgcatgc tcttgtggtt cttgctgcaa gtttgatgag 3720 

gatgactctg agccagttct caagggtgtc aaattacatt acacataa 3768 
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<212> Type : DNA 

<211> Length : 3768 

SequenceName : SEQ ID 
SequenceDe script ion : 



657 



Sequence 

<213> OrganismName : SARS coronavirus BJ01 
<400> PreSequenceString : 

10 atgtttattt tcttattatt tcttactctc actagtggta 
acttttgatg atgttcaagc tcctaattac actcaacata 
tactatcctg atgaaatttt tagatcagac actctttatt 
ccattttatt ctaatgttac agggtttcat actattaatc 
atacctttte aggatggtat.,.ttattttgct gccacagaga 

15 tgggtttttg gttctaccat gaacaacaag tcacagtcgg 
actaatgttg ttatacgagc atgtaacttt gaattgtgtg 
tctaaaccca tgggtacaca gacacatact atgatattcg 
ttcgagtaca tatctgatgc cttttcgctt gatgtttcag 
cacttacgag agtttgtgtt taaaaataaa gatgggtttc 

20 caacctatag atgtagttcg tgatctacct tctggtttta 
aagttgcctc ttggtattaa cattacaaat tttagagcca 
gctcaagaca cttggggcac gtcagctgca gcctattttg 
acatttatgc tcaagtatga tgaaaatggt acaatcacag 
aatccacttg ctgaa,ctcaa atgctctgtt aagagctttg 

25 cagacctcta atttcagggt tgttccctca ggagatgttg 
aacttgtgtc cttttggaga ggtttttaat gctactaaat 
gagagaaaaa aaatttctaa ttgtgttgct gattactctg 
ttttcaacct ttaagtgcta tggcgtttct gccactaagt 
aatgtctatg cagattcttt tgtagtcaag ggagatgatg 

30 caaactggtg ttattgctga ttataattat aaattgccag 
cttgcttgga atactaggaa cattgatgct acttcaactg 
aggtatctta gacatggcaa gcttaggccc tttgagagag 
tcccctgatg gcaaaccttg caccccacct gctcttaatt 
tatggttttt acaccactac tggcattggc taccaacctt 

35 tttgaacttt taaatgcacc ggccacggtt tgtggaccaa 
aagaaccagt gtgtcaattt taattttaat ggactcactg 
tcttcaaaga gatttcaacc atttcaacaa tttggccgtg 
tccgttcgag atcctaaaac atctgaaata ttagacattt 
gtaagtgtaa ttacacctgg aacaaatgct tcatctgaag 

40 gttaactgca ctgatgtttc tacagcaatt catgcagatc 
atatattcta ctggaaacaa tgtattccag actcaagcag 
catgtcgaca cttcttatga gtgcgacatt cctattggag 
catacagttt ctttattacg tagtactagc caaaaatcta 
ttaggtgctg atagttcaat tgcttactct aataacacca 

45 tcaattagca ttactacaga agtaatgcct gtttctatgg 
aatatgtaca tctgcggaga ttctactgaa tgtgctaatt 
ttttgcacac aactaaatcg tgcactctca ggtattgctg 
cgtgaagtgt tcgctcaagt caaacaaatg tacaaaaccc 
ggttttaatt tttcacaaat attacctgac cctctaaagc 

50 gaggacttgc tctttaataa ggtgacactc gctgatgctg 
gaatgcctag gtgatattaa tgctagagat ctcatttgtg 
acagtgttgc cacctctgct cactgatgat atgattgctg 
agtggtactg ccactgctgg atggacattt ggtgctggcg 
gctatgcaaa tggcatatag gttcaatggc attggagtta 

55 aaccaaaaac aaatcgccaa ccaatttaac aaggcgatta 
acaacaacat caactgcatt gggcaagctg caagacgttg 
ttaaacacac ttgttaaaca acttagctct aattttggtg 
gatatccttt cgcgacttga taaagtcgag gcggaggtac 
ggcagacttc aaagccttca aacctatgta acacaacaac 

60 agggcttctg ctaatcttgc tgctactaaa atgtctgagt 
agagttgact tttgtggaaa gggctaccac cttatgtcct 
ggtgttgtct tcctacatgt cacgtatgtg ccatcccagg 
ccagcaattt gtcatgaagg caaagcatac ttccctcgtg 
ggcacttctt ggtttattac acagaggaac ttcttttctc 

65 aatacatttg tctcaggaaa ttgtgatgtc gttattggca 
gatcctctgc aacctgagct tgactcattc aaagaagagc 
catacatcac cagatgttga tcttggcgac atttcaggca 



gtgaccttga 
cttcatctat 
taactcagga 
atacgtttga 
aatcaaatgt 
tgattattat 
acaacccttt 
ataatgcatt 
aaaagtcagg 
tctatgttta 
acactttgaa 
ttcttacagc 
ttggctattt 
atgctgttga 
agattgacaa 
tgagattccc 
tcccttctgt 
tgctctacaa 
tgaatgatct 
taagacaaat 
atgatttcat 
gtaattataa 
acatatctaa 
gttattggcc 
acagagttgt 
aattatccac 
gtactggtgt 
atgtttctga 
caccttgctc 
ttgctgttct 
aactcacacc 
gctgtcttat 
ctggcatttg 
ttgtggctta 
ttgctatacc 
ctaaaacctc 



tgcttctcca 
ctgaacagga 
caactttgaa 
caactaagag 
gcttcatgaa 
cgcagaagtt 
cctacactgc 
ctgctcttca 
cccaaaatgt 
gtcaaattca 
ttaaccagaa 
caatttcaag 
aaattgacag 
taatcagggc 
gtgttcttgg 
tcccacaagc 
agaggaactt 
aaggtgtttt 
cacaaataat 
tcattaacaa 
tggacaagta 
ttaacgcttc 



ccggtgcacc 
gaggggggtt 
tttatttctt 
caaccctgtc 
tgtccgtggt , 
taacaattct 
ctttgctgtt 
taattgcact 
taattttaaa 
taagggctat 
acctattttt 
cttttcacct 
aaagccaact 
ttgttctcaa 
aggaatttac 
taatattaca 
ctatgcatgg 
ctcaacattt 
ttgcttctcc 
agcgccagga 
gggttgtgtc 
ttataaatat 
tgtgcctttc 
attaaatgat 
agtactttct 
tgaccttatt 
gttaactcct 
tttcactgat 
ttttgggggt 
atatcaagat 
agcttggcgc 
aggagctgag 
tgctagttac 
tactatgtct 
tactaacttt 
cgtagattgt 
atatggtagc 
tcgcaacaca 
atattttggt 
gtcttttatt 
gcaatatggc 
caatggactt 
tgctctagtt 
aatacctttt 



tctctatgag 
agaatcactt 
tgctcaagca 
tgtgctaaat 
gttaattaca 
tgctgaaatc 
acaatcaaaa 
agccccgcat 
caccacagcg 
tgtgtttaat 
tactacagac 
cacagtttat 
cttcaaaaat 
tgtcgtcaac 



60 
120 
180 
240 
-3,0 0 . 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
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attcaaaaag aaattgaccg cctcaatgag gtcgctaaaa atttaaatga atcactcatt 3540 

gaccttcaag aattgggaaa atatgagcaa tatattaaat ggccttggta tgtttggctc 3600 

ggcfctcattg ctggactaat tgccatcgtc atggttacaa tcttgctfctg ttgcatgact 3660 

agttgttgca gttgcctcaa gggtgcatgc tcttgtggtt cttgctgcaa gtttgatgag 372 0 

5 gatgactctg agccagttct caagggtgtc aaattacatt acacataa 3768 
<212> Type : DNA 
<21ZL> Length : 3768 

SequenceName : SEQ ID 658 

SequenceDescription : 



Sequence 

<213> ,Organi sreName Escherichia. 
15 <40O> PreSequenceString : 

atgaacaaaa tatttaaagt tatctggaac 
gaaacggcaa aaagccgtgg caagaaatct 
gttcgcgggtg gaatgttgtc gtcgtttggg 
cagggtgttg attacggtag tggatcagct 
20 gcgaaagcaa atacttttat gaacaccagt 
atagctgaag gccaatatag ctctgccatc 
tcaatggcct ttggggttag tgcaatatca 
tcttcgtatt cattgggcca atactcaatg 
aaattgtcta ttgctatggg ggactcttcc 

25 ggaaatgcca ctaaagctac tgagattatg 
tcaaaagcgt attcaatggc gctgggagca 
gcgataggtg ctgagaccga agccgctgaa 
gcaaaaggga ctaatagcat ggcaatgggg 
atcg-cattag gaaatggcag ccaggctctg 

30 aacaaagctg atggcgtgga tgccatcgct 
aacaccattg ccttaggcac agccagfcaat 
agtaatagca gtgccaacgg fcattaactct 
ttacjacaata ccgtctctgt cggcaatagfc 
aatg-gcgcga tcaagtctga cragttacgat 

35 agccjactcgg tagcaaaaag gcttggagga 
gfctacagcac caacctacaa tttaaaaaat 
gctcyfcacttg atgaaaacac cctgcaatgg 
catcjgtacta gtagcccaac tgccagcgta 
gcctccagta aggatgcggt taacggttcc 

40 gccaacaccg ccaatatcgc tactaatacc 
gccaccaata ccaccaatat caccaacctg 
gcccrtgctct ggaacgaaac taaaaaggca 
agcaaaatca ccaacgttaa agatgccgac 
ggctctcagc tgaaaaccac caacgatgct 

45 aacacttcca atattgccac taacaccacc 
aatcttggtg aggatgcgct gaaatgggat 
ggcaccgaga ccaccagcaa aatcaccaac 
acccjatgccg ttaacggctc tcagctgaaa 
accaatatcg ccactaacac caccaacatc 

50 ggtgaggatg cgctgaaatg ggataaggac 
aataccgcca gcaaaatcac caatatcctg 
gccafctaacg gtagccagct ttatgactta 
aatgjcttctg tgaatactga cggtgtgttt 
aattattata acgtcggcga tgcactggct 

55 ggcgjatgctc tgctttggga tgccaccgca 
ggtgacgcaa gcgtgatcac tgatgtcgca 
gcagtaaacg gctcacaact ccacggcgtg 
ggtgccgaag tcaatgcaga cggcaccatc 
gattacgata atgtcggtga tgccctgaat 

60 ctctgggatg cggacgccgg tgaaaatggt 
actgccagtg taatcactaa cgtcgctaac 
attaacggct cacaactcta taccaccaat 
gcagaagtca acgctgacgg caccatcacc 
tacaacaacg tcggtgacgc cctggatgcg 

65 actg-ccaatg gcggtgctgg agcctacaat 
actaatgtcg ctaatggcag tattagtgag 
ttgaatgcga cgaatatgat gattgagcag 



coli 0157:H7 



cctgcgacag ggaattatac tgttaccagc 60 

gggcgcagta agctgttaat ttctgcgctg 12 0 

gcattggcga atgccgggaa tgacaacggt 18 0 

ggcgacggct gggttgctat aggcaaaggg 240 

ggttccagta ctgctgtggg ttatgacgct 3 00 

gggtcaaaaa cccatgcgat tggtggtgca 360 

gaaggcgata gaagtatagc actgggtgcc 42 0 

gccctcggcc gttattcaaa agcattgggt 480 

aaagcggaag gagcaaacgc cattgccctg 540 

agtattgctc ttggcgacac cgccaatgcg 60 0 

agtagcgtcg catctgaaga aaacgctatt 660 

aatgcaactg ctattggcaa taatgcgaag 72 0 

ttcggaagcc ttgccgataa agtcaatact 78 0 

gcagataatg caatcgccat aggccagggc 840 

ctgggtaatg gtagccagtc gagaggctta 900 

gcaactggtg ataagagtct tgcgcttggt 960 

gtcgcgctgg gcgcagattc cattgcggat 102 0 

tcattaaaac gcaagatcgt taatgtgaaa, 1080 

gccattaatg gttcacagct ttatgccatt 1140 

ggggctgcag tagatgttga tgacggtact 1200 

ggtagcaaaa ataacgtagg ggctgcgctc 1260 

gaccaaacca aaggcaaata cagcgctgct 1320 

atcaccgatg ttgcggatgg cacgatttca :13 8 0 

caactgaaag ctaccaatga cgatgtcgaa 1440 

agcaacattg ccacgaatac ggcaaatatt 1500 

acggattccg ttggtgacct tcaggctgat 1560 

ttcagtgcag ctcacggcca ggataccacc 162 0 

ctgacggctg acagcactga tgctgttaac 1680 

gtggcgacga ataccaccaa tatcgccaat 1740 

aacatctcta acctgactga gacggtgact 18 00 

aaggacaatg gtgtattcac ggcagctcat 1860 

gttaaagatg gcgacctgac gactggcagc 192 0 

accaccaacg atgccgtggc gacgaatacc 1980 

tctaatctga ctgagacggt gactaatctt 2 04 0 

aatggtgtct tcactgcagc tcatggcaac 2100 

gacggcacag tcactgcaac cagttccgat 2160 

agcagcaata tcgccaccta cttcggcggc 2220 

accggtccaa cctacaaaat cggtgaaaca 2280 

gcgattaact cctcatttag cacgtctctc 2340 

ggtaaattca gtgccaaaca cggtactaat 24 00 

gatggtgaaa tttcagactc cagttctgac 2460 

agcagttatg ttgttgatgc gctggggggt 2520 

actgcgccga cgtacaccat tgctaatgct 25 80 

gctatcgata ccactcttga cgacgctctg 2640 

gcatttagcg ccgctcacgg aaaagataaa 2700 

ggtgcaatct ctgctgccag cagcgacgcg 2760 

aagtacatcg ctgatgcgct gggtggtgac 2820 

gcaccgactt acaccattgc gaacgccgag 28 80 

cttgatgata acgccctgct gtgggatgag 2940 

gccagccatg acggtaaagc cagcatcatc 300 0 

gacagtaccg atgcagtgaa cggfctctcag 3 060 

aacacccaaa ttatcaatca gctcgctggt 3120 
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aacaccgacg caacctatat ccaagaaaac 
gacgacggct tagcgttcaa cgacgccagc 
tataactctg tcgccaaagg cgatagcagc 
gttgatacgg gtatcgccct gggtagcagc 
5 tcccgtgaca ccagcataac ggaaaatggc 
gaactgctcg gtgcattgtc tatcggtgat 
gccgatggtt ccgaagccca tgacgccgtt 
gcggtcgcaa ccacgccgac taaatacttc 
gcagtgggaa ctgactcgct ggcaatgggt 

10' attggtatcg gttatggtgc ctacgtggac 
agcaatgcgc aagtcattca tgtcaacagt 
cgtggcgctc aaaccaatta taccgcctac 
gaattctcag tcggtagtgc ggatggtcaa 
gctgataccg atgcggtcaa cgtgggtqag 

15 aatacccaga gcattactaa cctggataat 
aatatcgaaa acggtattgg cgatatcgtc 
aataccgatg gtgtagatgc cagcgcgcag 
tccattgctg ccgctgacaa cagcgtcgct 
aatacgatct ctgtaggttc ctctactaac 

20 aaaaatgcta ccgatgctgt taacgtggca 
cgttacgaca ccaaagctga tggttctatc 
aacggcggta cgactcgtat cagcaacgtc 
aattacgcgc agttgaagca aagcgtgcag 
gttgagatgg ataacaaact gtctaaaact 

25 gcaatggcaa tgaccggtct gccgcaggct 
ggtggcggta cttacaacgg tgaatcggca 
aatggtcgtt gggtctacaa attacaaggt 
gcactcggtg ccggtattca gtggtaa 
<212> Type : DNA 

30 <211> Length : 4767 

SequenceName : SEQ ID 659 
SequenceDe script ion : 



ggtgcgggta ttaactatgt gcgtactaac 3180 

gcacagggtg ttggcgctac agctataggt 3240 

gtagctattg gtcagggcag ctacagcgac 3300 

tctgtttcca gccgagtgat tgccaaaggc 3 3 60 

gttgttattg gttacgacac cacggatggc 3420 

gacggtaaat atcgtcaaat catcaacgta 348 0 

acggttcgtc aattgcagaa tgcgattggt 3540 

cacgctaatt caacggaaga agattcactg 3 600 

gcgaaaacca tcgtgaatgg cgataaaggt 3660 

gcgaatgcac ttaacggcat tgccattggt 3720 

attgcgatag gtaatggttc tacgaccact 3780 

aacatggacg caccgcagaa ctctgtcggt 3 840 

cgtcagatca ctaacgtcgc agcaggttcg 3900 

ttgaaagtaa cggatgcgca ggtttcccag 3960 

cgggtaacga atcttgattc acgcgtcacc 4020 

accaccggta gcaccaagfca cttcaagacc 4080 

ggtaaagata gcgtcgcgat tggttccggc 4140 

ctgggtacag ggtctgtggc aaccgaagaa 4200 

caacgtcgta tcaccaacgt agctgcaggt 4260 

cagttgaagt cttccgaagc tggcggtgta 4320 

gactatagca atatcaccct cggtggcggc 43 8 0 

tccgctggcg tcaacaacaa cgacgtggtg 4440 

gaaacgaagc aatacaccga tcagcgaatg 4500 

gaaagcaagt tgagcggtgg tatcgcttct 4560 

tacactccag gtgccagcat ggcctctatt 462 0 

gttgctttag gtgtatcgat ggtgagcgcc 4680 

agtaccaata gccagggtga atactccgcc 4740 

4767 



35 



Sequence 



<213> OrganismName : Escherichia coli 0157:H7 
<400> PreSequenceString : 

gtgccagctt ccgcagtagg tgcgctgggc gaagccagct acacggtgac ggcgaacgtc ■ 60 

accgacagcg caggcaacag caattccgcc agccataacg tgcaggtcaa taccgcgctg 120 

40 cctggcgtca ccattaaccc agttgcgacc gacgatatta ttaacgccgc cgaatcgggc 180 

aatgcgcaaa ccatcagcgg ccaggtgacg ggggcggcgg cgggcgatac ggttaccgta 240 

acgcttggcg ggaaaactta caccgctacc gtgcagggga atttaagctg gagcgtggac 3 00 

gttccggcgg cggatattca ggccatcggc aatggcaatc tgacggttaa cgcttcggtg 360 

accaacggcg ttggcaatac tggcagcggt tcgcgagata ttactatcga cgccaacctg 420 

45 ccaggtctgc gcgttgacac cgtggcgggc gatgatgtgg tcaatagcat cgagcacgct 480 

caggcgctgg tgataactgg tagcagcagc gggctggcgg cgggcgcggc gctgacggtg 540 

gtgattaaca cggtcactta cgctgcaaca gtattagccg atggcacatg gagcgttggt 60 0 

gttccggcgg cagacgtgag taactggcct gcgggtacgg tgaatatcac ggtaagtggc 660 

actaacacag ccggaacaac atccaccatc acccatccgg tcaccgtcga tctggcggcg 72 0 

50 gtggcgattt ccattaacac cgtttccggc gacgatgtga ttaacgccgc cgaaaaaggg 780 

gcagatttaa ccctttccgg cagcacctcc ggcgtggaag tggggcaaac ggtcaccgtt 84 0 

acctttggcg ggaaaaccta caccgctacc gtagcgggcg atggtagctg gacaaccacc 900 

gtacccgccg ccgatctcag cgtgttacgc gacggcgacg ccaccgtgca ggccagcgtc 960 

agcactatta acggcaacac ggcttcggca acccacgcct acagcgtcga tgccacggcc 1020 

55 ccgacgcttg ccattaacac catcgccacc gacgatattc tgaacgctgc cgaggcgggc 1080 

aatccgttaa ccatcagcgg tagcagcacc gccgaagcgg ggcagacggt aaccgtcacg 1140 

cttaatggtg tgacttacag cggctccgtc caggcggacg gcagctggag cgtcagctta 1200 

ccgacggcgg atctcagcaa tctgaccgcc agccagtaca ccgttagtgc ctcggtaagc 1260 

gataaagcgg gtaacccggc gtccgctaac cacgggctgg cggtggatct caccgtgccg 1320 

60 gtgctgacca tcaacaccgt ctccggcgat gacattatta acgccgccga acacggacag 13 80 

gcgctggtga tctccggctc cagcactggc ggcgaagcgg gtgatgtcat caccgtcaca 1440 

ctaaacagta aaacctacac caccatgctg gacgcttccg gcaactggag cgtcggcgtt 1500 

ccggcggctg acgtcactgc gcttggcagc ggcccgcaaa ctatcactgc ggcaattacc 1560 

gacgcggcag gcaacagcga tgacgccagc cgcacggtca ccgtgaatct cgccgcgcca 1620 

65 accattggta tcaacaccat cgccaccgat gacgtgatta aagccacgga aaaaggcgcg 1680 

gacctgcaaa tcaccggcac cagtaatcag cctgcgggca ccaccattac ggtgacgctg 1740 

aacgggcaaa attacaccgc tactaccgat agcaacggca actggagcgc cacggtgcca 1800 
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gcgtcagcgg 
acggcaggca 
gttaccatta 
caaaccatca 
5 ggcggcaaca 
gcggcggata 
ggcgtcggca 
ctgcgggtcg 
ctggtgatca 
10 aacaacgtta 
gcggtagacg 
agtgccggaa 
atcaccatta 
ctggtccjtaa 
15 ggcggcaaaa 
cctgccgjatc 
attaaccggca 
gtgaccatca 
atcaccatca 
20 aacaacacgt 
gcggatfctaa 
gcgggtaacc 
accattaaca 
gtggtcagcg 
25 ggtaaaaact 
gcggatgftca 
gcgggcaaca 
accattaaca 
accatcagcg 
30 ggcggcaatc 
gcgaacg-ttt 
tccgcaaata 
ctgcgcgtcg 
ttggtgafcca 
35 aatagcgtca 
gtggcacjatg 
accgcccpcjaa 
atcaccatca 
ttgcagcrtct 
40 ggcggcaaaa 
gccgtccjatg 
gtcgcaggga 
gttaccatca 
ctgaccatca 
45 ggcgttaatt 
ggcgatttag 
gccagaaatt 
accatcaaca 
atcatcagcg 
50 acgaccacct 
agcgtgattt 
gcaggcaaca 
gccattaaca 
gcgattaccg 
55 caaaattaca 
cgggtga^cg 
ggcaacagcg 
attaacgtgg 
atcagcgggc 
60 gcgacttaca 
gcgctacagg 
ggcaatactg 
gtcgacaccg 
attaccgcjca 
65 caaacctatg 
gatgtcacgcg 
ggaaatccgg 



ttagcgcatt 
acagtaattc 
acgcggtggc 
gcgggcaggt 
cctacaccgc 
ttcaggcgct 



acaccggcag 
ataccgtggc 
ccggcagcag 
cttatggcgc 
tcagcaactg 
caacctccac 
acactctctc 
gcggcagcac 
attacaccac 
tcgctgcgct 
acagcgccca 
acaccatcgc 
gcggcac tac 
accagaccac 
gcggattaac 
cggcaagcgc 
ccgtcgcggg 
gcaccagtac 
acaccaccac 
cggcgctggc 
gcgacagcac 
ccgtctccgg 
gtcaggtcac 
aatataacgc 
tgcaggcgct 
ataccggcac 
ataccgtggc 
ccggtagcag 
cctacggcgc 
tcacaaactg 
ccacaaccag 
acaccctttc 
ccggcaccac 
gctacaccac 
ttgcaacctt 
acagcaccca 
acaccatcgc 
gcggcac cag 
acagcggcaa 
ctagccttac 
cggcttcggc 
cggtggcggg 
gctcggcaac 
ataccaccgt 
ccgcgctggc 
gcggcacggc 
ccattgccgt 
gcaccagcaa 
ccaccactgc 
ccctcggtga 
gttccgccag 
tggcaacgga 
aggtgacggg 
ctgccacggt 
agttgggcaa 
gtaacggcac 
tggcgggcga 
gcagctccgg 
ttgcggcggt 
cctggcctgc 
taagcgttac 



gggtgaagcc 

cgccagtcat 
gactgacgat 
gacgggcgca 
cacggtgcag 
gggcaacggc 
cggctcgcgc 

gggcgatgat 

cagcggatta 
gacggtatta 
gccagcgggt 
cattacccat 



tggtgatgat 
cagcggtgtc 
cacagtggaa 
accggacggc 
ggccgatcgc 
cagcgacgat 
cacggcgcag 
cgttctggcg 
cgccagcagt 
cgaccacgcg 
cgatgacatt 
gggcgcggcg 
gctggatgcc 
gaccggtagc 
gactcacgat 
cgatgacatt 
gggaacggcg 
caccgtgcag 
gggtaacggt 
cgcgacgcac 
aggcgatgat 
tagcggactg 
aacggtttta 
gcctgcgggg 
cattagccat 
cactgacgat 
ctccggcgtg 
cacggttgcc 
gccagacggc 
ggcaacgcat 
cacggacgat 
cacggcggaa 
tgtgcaggca 
cgccagctcg 
aacgcataat 
cgatgacatc 
gggcgcgact 
gctggacgcc 
gcagggcgat 
ctcgcacact 
cgatgacatc 
ccagcctgcg 
cgatgcttcc 
agccacctac 
ccataacgta 
cgatattatt 
tgcggcggca 
gcaggcgaat 
cggcgaactg 
gcgcgaaatc 
tgatgtggtt 
tctggcggcg 
gctggcggat 
gggatcggtg 
gcatccggtg 



aactacacgg 
aatgtgctgg 
attattaacg 
gcgcaggggg 
tctaatttaa 
gacctgacgg 
gatatcacga 
gttattaaca 
acggcgggaa 
gccgacggca 
acggtgaata 
ccggttaccg 
gtgatcaacg 
gaagccgggc . 
gctaacggta 
gcgggcaacg 
gcgtatagcg 
atccttaacg 
gccgggcaga 
gatggcacct 
tacaccgtga 
ctggtggtag 
atcaacgcca 
gcgggtgatg 
tccggtaact 
cagaccatca 
gtgaccgttg 
atcaacgccg 
gttgccggga 
tcagatttaa 
gaactgacca 
gatatcgtga 
gtgattaaca 
gcggcgggcg 
gcggatggta 
acggtcaata 
ccggtcacgg 
gtgattaacg 
gaagcggggc 
gcggataata 
gcagcaaacg 
gcttacagcg 
attcttaacg 
gccgggcaga 
gacgggagct 
tataccgtta 
ctgacggtgg 
attaacgcca 
accggtaata 
aacggcaact 
gtgaccatta 
gtcaccgtgg 
atcaacgccg 
ggcacgcaga 
ggtaactgga 
acggtgaccg 
caggttaata 
aacgccgccg 
ggcgacacgg 
ttaagctgga 
accatttcgg 
accatagacg 
aatattatcg 
ggcagcaacg 
ggcacctgga 
acgattgcgg 
acggtcgatc 



taacggcaaa 



tcaacagcgc 
ctgccgaatc 
atacggttac 
gctggagtgt 
ttaatgcctc 
tcgacgccaa 
gcattgagca 
cggcgttaac 
cgtggagcct 
tcacggtaag 
tcgatctggc 
ccgtcgaaaa 
agacggtgac 
gctggacggt 
tgcaggcgag 
ttgatgccac 
tgagcgaagc 
cgctcaccgt 
ggagcgtgaa 
ccgccacggt 
atatcactgc 
tcgaacatgg 
tggtaaccgt 
ggagcgtggg 
ccgccagcct 
atcttagcgg 
ctgaaatcgt 
atacggtgat 
gctggagcgt 
tcagcgcctc 
tagatgccaa 
gcatcgagca 
cggcgttgac 
gctggagcgt 
ttgccgtctc 
tcgatctcgc 
ccgccgagaa 
aaaccatcac 
cctgggggct 
tccaggccag 
ttgatgccac 
ccgccgaagc 
cggtgaccgt 
ggagcgtcag 
acgcctcggt 
accttgccgc 
cggaacacgg 
cggtttccgt 
ggagcatcgg 
ccgctacggt 
cgctcggcgc 
cggagaaagg 
ttaccgttac 
gcgtgaccgt 
cagccgccac 
ccgcgctgcc 
aagcgggcgt 
tgaccgtcac 
gcgtcgatgt 
cttcggtgac 
cgaatctccc 
agcacgggca 
tcacgctgac 
gcgtcggcgt 
cgagcggtag 
tctcggcggt 



cgtcactgat 
cttgcctgcc 
gggaaatgcg 
cgttacgctg 
ggacgttccg 
ggtcaccaat 
tctgcctggc 
caatcaggcg 
ggtcgagatt 
cggtgttccg 
cggcaccaac 
tggggtcgcc 
aggcgaaacg 
cgtcaccttt 
gaatgtgccg 
tgtcagtaat 
cgcgccgctt 
tggcgcgggg 
cacgctcaat 
tgttccggca 
gagcgacaaa 
gccggatctc 
tcaggcgttg 
cacgctgaac 
cattccggcg 
gagcgatcgc 
cccgacgctg 
tgtggcgcag 
tgtcaccatt 
cagcgtaccg 
gttgaccaat 
cctgccaggt 
cactcaggcg 

ggtggttatc 
tggtgttccg 
tggcaccaac 
tgccgtggcg 
aggctctgat 
cgttatcttc 
gacgatccct 
cgtcagcaat 
tgcgccctcc 
gggatcggcg 
aacgctaaac 
cgtaccaacc 
cagcgacaaa 
tccggtcgtc 
acaggcgcag 
gacgattggc 
cgtgcctgcc 
caccgactcc 
tccggtgctc 
cgcggatctg 
gctcaacggg 
tccggcgtca 
tgacgccgat 
gggcgtcacc 
ggaacagacc 
gctcggcggg 
tccggcctcc 
gaacagcgta 
cggtctgcgg 
ggcgctggtg 
cattaacggg 
tccggcggtg 
cacctctgcc 
ggcggtgagc 
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atcaacgcca ttaccgccga tgatgtgatc 
ctctccggca gcacctctgg cgttgaagcc 
aaaacttaca gcgccacggt ggctgcgaat 
gatatggcgg ctctgcgtga tggcgatgcc 
5 ggcaacagcg ccaccacgac ccacgcttac 
attaatacca ttgcgggcga tgatattctt 
atcaccggca gcagcacggc ggaagcgggg 
aactacaccg gcaccgtaca gacggacggc 
ttaagcaccc tgaccgccag caactacacc 
10 aacccggcct cggttaatca caacctgacg 
aacacggtgg caggcgatga tgtgatcaac 
agcggctccg ccactggagc ggcaaccggt 
acctttacca cggtgctgga tgccagcggc 
gtctcggcac tggcgaatgg. cacggtgacc 
15 aacagcggca gcgctaccca tcaggtgacg 
aacgcoatca gcggcgataa catcctgaac 
agcggcggca gtacggggct ggcgacgggc 
aactacagcg ccaccaccga cgcatcgggc 
ctggcggcat taggtcaggc caactatacg 

20 aacaccgcca gcagccaggc gaatttactg 
aacaccgtgg caggcgacga tattatcaac 
agcggggtgg tgactcgcgc cgccgctggc 
acttacaccg ctacggtaca gagcaactta 
ctccaggcgt tgggcaatgg tgatttgacc 

25 aacaccggga gcggcacgcg ggatatcacc 
gataccgtgg cgggcgatga tatcgtcaac 
accggcggca gtagcggcct gaatgcaggt 
gcgtattccg ccacggtgca ggcggacgga 
gtcagcgcct ggcctgcggg gccgttaacc 

30 aacccagtca gcgtcagcca tccgttcacc 
aacaccgttg ccagcgacga cgtgattaac 
tccggoagta ccagcgggat tgagagcggg 
acctacactg craagcgtcgc cgccaacggg 
ctggcaactc tgccagaggg cgcggcgaat 

35 aacagfcgcct cggcgaccca tgcgtatagc 
aacaccatcg ccagcgacga tatccttaac 
agcggcacca gcaccgccga • aaccgggcag 
tacaccggca ctgtgcaggc ggacggtagc 
ggcgccjctca acgcaagcaa ttacaccgtc 

40 cccggcagcg ccagccataa tctggcggta 
accgtgjgcgg gcgatgacat catcaacgat 
ggcaccagta gcggcgggga agcgggcgat 
tacaccacca ccctggatgc ctccggcaac 
acggccjctgg gtagcggtgc gcagaccatc 

45 agcgacgacg ccagccgcac cgtgaccgtc 
accatcgcgg gcgatgatgt gatcaacgcg 
ggcaccagcg atcagcctgc gggtacggcg 
agcgccacca cggatgcctc cggcaactgg 
gcgctgjggcg aagcgaccta cagcgtgacg 

50 agcaccgcca gccataacgt gcaggttaat 
gtggcgacgg acgatattat caacgcttca 
caggtaaccg gtgcggcggc gggcagcacc 
accgccaccg tccaggctga tttaagctgg 
gctctcjgjgta acggtgagtt aacggttaat 

55 ggtagcggca cgcgcgatat caccatcgat 
gtcgcgggtg atgatgtggt caatatcatc 
agcagctccg gctttgccgc aggcacggcg 
gccgccacgg tgttggcgaa cggtagctgg 
aactggcctg cgggaacgct gaatattacc 

60 accagcatta cccatccgct gactgttgat 
atcaccagcg atgatgcgat taacgccgcc 
agcacgtccg gcgtcgaagc ggggcaaacc 
accaccacgg tggcggcaaa cggtagctgg 
gccctcjcgtg atggcgatgc cagcgcccag 

65 gccacagcaa cgcacgaata cagcgtcgat 
attgccagcg ataacatcat caacgccagc 
accagcaccg cgcaaaccgg gcagacgctc 



aacgctgccg aaaaaggcgc ggcgttaacg 588 0 

ggacaaacgg ttaccgtcac ctttggcggc 5940 

ggttcctgga gcacctcggt tccggcggca 6000 

agcgcacagg ccagcgtcag caatgttaac 6060 

agcgttgatg ccagcgcgcc aacggtgacc 612 0 

aacgccgccg aagccggagc ggctctgacc 618 0 

cagacggtga ccgtcacgct caatggcaca 6240 

agctggagcg tcagcgtacc gtcagccgac 63 00 

gtgaacgcgg cggtgagcga caaagccgga 63 60 

gtggatacgt ccgttccggt cgtcaccatc 642 0 

gcgacggaac acgcccaggc gcagatcatc 64 8 0 

agcaccgtga cggtgactat cggcacaaat 6540 

aactggagcg tcggcgttcc ggcaagcgtc 6600 

atcaatgcca gcgtcaccga tgccggagga. 6S60 

gtcaataccg ggctgccgac cattaccttt 672 0 

gccgatgaaa aaggccagcc gttgaccatc 678 0 

gcgcaggtca ccgtcacgct caacggtcac 684 0 

aactggacct taaccgtgcc ggtgagcgat 6900 

gtcagcgcca gcgccaccag tgcagcaggc 69 60 

gtcgacagcg gcctgccgga cgtcaccatc 702 0 

gccgccgaag cgggggccga tcaaaccatc 708 0 

gatacggtca ccgtgacgct gggcgggaac 7140 

agctggagcg tcagcgttcc gacagccgat 7200 

attaccgcct cggtcaccaa cgctaatggc 7260 

attgatgcca acctgccggg gctgcgcgta 732 0 

agcatcgagc acgggcaggc gctggtgatc 73 80 

gctgtgctga cggttaccat caacagtgtg 7440 

agctggagcg ttggcattcc ggcggcaaac 7500 

gtggaggtag acgggcaaag cagcgccaat 7560 

gtcgatttaa cggcggtggc aatcagcatc 762 0 

gccgcagaaa aaggcaccaa tctgactctt 7680 

caaaccgtca ccgtcacttt tggcggtaaa 7740 

agctggagtg taaacgttcc ggcggcagat 7800 

gtgcaggcca gcgttagcag cgcgagcggt 7860 

gttgacgcca gcgcgccgac gctcaccatt 7920 

gccgcagaag ccggaagccc gctcacc'atc 7980 

acggtgaccg tcacccttaa cggcgcaacc 8040 

tggagcgtca gcgttcccac ttcagccctg 810 0 

agcgccacgg tcaatgacaa agcgggcaac 8160 

gacaccaccg cgccggttct caccattaac 8220 

gccgaacatg cgcaggcgct ggtgatctcc 8280 

gtggtgagcg ttgtgctcaa cggcaaaacc 8340 

tggagcgttg gcgttccggc ggcagatgtt 840 0 

accgccagcg tcagcgatcg ggcaggcaac 8460 

agcctcagcg cgccagtgat tagcatcaac 8 52 0 

acggaaaaag gatctgatct ggcgctttct 8580 

atcaccgtca ccctgaacgg acaaaactac 864 0 

agtgttaccg tgcctgcctc ggcggtcagt 8700 

gcgagcgtca ccaatgctca gggtaacagc 8760 

accgcgctgc cgggcatcac cattaatccg 8820 

gaggcaggca gcgcgcaaac tatcagcggc 8880 

gtcaccgttg aactgggcgg taaaacttac 894 0 

aatgtgagcg tacctgccgc cgactggcag 90 00 

gcctcggtga ccaacgccgt tggcaacacc 9 060 

gccagtctgc ctggcctgcg ggtggatacc 9120 

gaacacgctc aggcgcaggt gatcaccggc 9180 

ttgacggtgg tgattaacaa ccaaacttac 9240 

agcgtcggcg ttcctgcgac ggatgtcagt 93 00 

gttagcgggg cgaacagtgc cggaacgcaa 93 60 

ctcaccgccg tcgccatcag catgaacagc 9420 

gaaaaaggcg cggcgttaac gctctccggc 948 0 

gtcaccgtca cctttggcgg caaaacctac 9540 

agcaccaccg ttccggcggc ggatttggcg 96 00 

gtgcgggtga ctaacgtcaa cggcaacagc 9660 

agcgccgcgc caacggtgac catcaacacc 9720 

gaagcggcgg cgggcgtaac ggtgtccggc 9780 

accgtcacgc ttaacggcac taactaccag 9840 
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accacggtgc agacagacgg cagctggagt 
cttgctaata acggctacac cctgaccgcc 
agcgccagca aaggcgtgac cgtcgatacc 
gcgggcgatg atgtgattaa caacgtcgaa 
5 gccacgggcg cggtggcggg cgaccgcctg 
accagcaccg atgccagcgg caactggagc 
cttgcggatg gcacggtgac catcagcgcc 
acgcagacgc acaacgtaca ggtgaacacg 
agcggcgata accttattaa cgccgccgaa 
10 ggcactaatt tcgcgacagg tacagtggtg 
gccaccattc agagcaacgg gagctggagc 
ctcagtgatg gcaccagcta cacggttagc 
aacagcagca cgcagacgca caacgtacag 
agcactatca gcggcgatsa ccttattaac 
15 agcggcactg gcactaattt cgcgacaggt 
ggctacagcg ccaccattca gagcaacggg 
gtfcgcggcac tcagtgatgg caccagctac 
ggaaacagcg ccacggcctc gcgcagcgtg 
attaacaccg tttcgacgga cgaccgcctc 
20 cttaacggct cgaccagcgc ggaagtgggg 
acctataccg ccacggttgc cgccaatggt 
ctggccgcgc tcgggcaggg cgcgcagacc 
aaccccggac aggccacgca cgccctgacc 
gccacggtgg cgggtgacga tattatcaac 
25 agcggcacca ccaccgccga agtgggccag 
tggagcgcaa cggttggcag cggcggaagc 
gccggattaa gcgacggcag ctacaccatt 
cccggcagcg ccagccgtgg cgtgacgctg 
acctttgcgg gcgacgatgt ggtgaatgca 
30 ggcaccacca ccgcgcccgt cgggcagacg 
accaccacgg tgcagacggg cggtagctgg 
gcgctggcgg acggcaacgc ctacgtgatt 
. , accggtagca gtaatcacac. cattaccgtc 
atcgattccc tgcaagccga cactggcctt 
35 ccgcjtagtgg tcaacggctc gctcaccgcc 
agtatcgacg gcggcacaac ctggaccacg 
aacgacagcc gcacgctgac cgatggcaac 
gcgggcaacg ttggcgcgac cgacagccag 
cccgcggtga aaaccatcgc catcagcgcg 
40 gafcfcttgtca ccagcgacac gacgcttgcg 
gccggtgagt tcgcgcaaat cagcctcgac 
gttggcacca gctggagcta tgccgatggt 
acggtgcggg tggtggatct ggcggggaac 
gtcgacacca ccagcccgga agcggcgaaa 
45 accggaacca gcagcagcga ttttattacc 
ttgcggcgcgg cgctcggcgc taatgagttc 
tgggtgaacg tgaccgtcgc cgcagacagc 
ctcaccaacg gcaccaccac ctggcaggtg 
gcaacgagca gccagtcggc gctgatcgat 
50 gccagcatca gcaccgacac ggggagttcg 
ctcacgctga ccggttcgct gggggcgggg 
cttgatagcg gcgcgacctg gacaacgctc 
gacagccgca cgctgaccga cggcagctac 
gggaacaccg gcccggtggt gtcgaaaacg 
55 acaccaacga ttgtgtcgta taccgatgat 
tcgcaggcca ccgacgacac tacgccgctg 
agcggtgaag tggtttacct ctaccgtaac 
ggcgctctga actggaccta cagcgacagc 
gcgcgagtgg tggatttggc ggggaatatc 
60 gatacctcta ttccgaccac gctggcgcaa 
ccgattatta gcggggtgat caccgccgcg 
atcaacggca aaacctacac ctctgaaccg 
aaca.cctggt atgtacagtt gccggatacc 
accgttactg cgcaggtaaa aagttcggcg 
65 ggcacggtga cggttaacgc ggcgattgat 
accaccgcct gggggctgac ctacggcctc 
aaccagcagg taatgcaatc gactgaccca 



ttaacgctgc ccgccagcga cttaaccgca 9900 

acggtcagcg atctggcggg taaccttggc 9960 

actgcgccgg tgatcagttt taacaccgtg 10020 

cacattcagg cgcagattat cagcggcacc 10080 

gtggtgacca tcgccgggca gcagtatgtc 10140 

gtcggcgtgc ctgccagcgt gatttccggc 102 00 

accattaccg atagcgcagg caacagcagc 10260 

gcggcagtgt cgctctcggt cagcactatc 10320 

gcgggcagtg cgctgaccct gagcggcact 103 80 

accgtgttgc ttaacggcaa aggctacagc 10440 

gtgaacgtgc ctgcggcgga tgttgcggca 10500 

gcctccgctc aggacagtgc cggaaacggc 10560 

gtgaacacgg cggcagtgtc gctctcggtc 10620 
gccgccgaag cgggcagtgc gctgaccctg- , 1063G 

acagtggtga ccgtgttgct taacggcaaa 10740 

agctggagcg tgaacgtgcc tgcggcggat 10800 

acggttagcg cctccgctca ggacagtgcc 10860 

gcggtggatc tcaccgcgcc ggttatcagc 1092 0 

aacgccgccg aacagcagca gccgttaacc 1098 0 

cagacggtca ccgtcacctt tggcggcaaa 11040 

acctgggcgc tgaacgtgcc tgcggtggat 11100 

attaccgcca gcgtgaacga tcgcgccggt 11160 

gtcgataccg ttgcgccaac ggtcaccatc 1122 0 

aacgccgagc agcttgccgg gcagaccatt 1128 0 

acggtgaccg ttacctttaa cgggcaaacc 11340 

tggtcggtgt ttattccggc gcagcagttt 11400 

agcgcgacgg ttagcgatca ggccggaaac 11460 

aacggcgatg tacctactgt caccatcaac 11520 

gcggaacacg gctcatcgct ggtgatcagc 11580 

ctgaccctga ccttaaacgg caaaacctac 11640 

agctacaccc tcggcagcgc cgatgtcacc 11700 

aacgcctcgg tgagcaatgc cattggcaac 11760 

gatctcagcg ctccggcgat gggcattaat 11820 

agcgccagcg actttatcac cagcgtcagt 11880 

gcgcttgcca gtaacgagac ggcgcaaata 11940 

cttaccgtta ccggcacgac ctggcgctat 12000 

tatctctatc aggtgcgggt gattgacgca 12060 

aatgtggtga tcgacactac cgcgccagat 12120 

atcaccaccg atatgggctt gatcactaac 12180 

gtgagcggca cgctgggggc gacgctttct 12240 

ggcggcgtca cctggactac gttaaccgtc 123 00 

cacacgctta ctgacggcac ctggaactac 123 60 

gttgggcaga ccgcaacgca aaacgtggtg 12420 

agtatcacca ttaccggtat cagcgatgac 12480 

agcgacacca cgcttaccgt gcgcggcgta 12540 

gcgcaaatca gtaccgacaa cggcgcaacc 1260 0 

ctgaactgga gttacgttga cggacgaacc 12660 

cgggtggtcg atctggcggg caacgttggc 12720 

accgttaacc cggcgcaggt gctcaccatc 12780 

gcaactgact ttatcaccag cgacaccatg 12840 

cttgccagcg gcgaagtggc gcagattagc 12900 

accaccaacg gtacacagtg gacttacacc 12960 

gtttatcagg tgcgggtgct ggatctggcg 13020 

gtggtggtcg atacgattaa ccccaccgcc 13080 

gtcgggcagc ggcaggggac attaagcagt 13140 

ctgaacggtg tactttccgc gccgcttgcc 13200 

gggctgctgt taggggcggt gacgatggtc 13260 

gggctggtga gcggtgccta tacctacagc 13320 

acctcctcca gtgattttgt cctgacggtc 133 80 

atcaccagcc agaccacgcg cgatactacg 13440 

cttgccagtg ggcagtatgt tgaagtggtg 13500 

ggcggcgcgg tagtggtcga tccggcgcac 13560 

gatgcgctga cagtttccgc gaccgcctat 13620 

ggtaacggca ataacgccaa tattagcaac 13680 

tacacaccga cctggactac cgccagcaaa 13740 

gactcgcacg ggatgtggac ggtgctggca 13800 

ctcacctggt cgaagaccgc gctgacgctg 13860 
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tatcagagcg gcaacaacta cgccaccagc 
ggcgatctgt ttatcacccg tgatgactac 
aacggcgatg gcaccttttc cagcgctatt 
ggctcgattg tggcatfctga taaagagggc 
gctggcgggc cggactccaa caccttcctg 
tccaccacgt cgaacagcgg cggtagcgcc 
tcgctcaacg aaggttctgg cgtcgatctg 
cacacctata acctgaacaa ctattacacg 
acgtttgtct gggggcsgaa caccaccaat 
atgagcagca gcgtttccat gacctgggcc 
ttcctgcccg ccagccaggg aagagctaac 
gtactgggtt gcccggtggc ggtgggcgca 
ctggcggtgg actggaacca cgacggcctg 
cagtcgtatc ttjbatactaa cgtcagcaac. 
ggcagccaga gcggtaccac cagcggcgtg 
gtggatgtgc tggtgtccaa acagtcgggc 
gtgagctacg gcacttcgct acacctgcgc 
tacggcaata ccgtgaa.cjct gtacaactcg 
aacccgcagt cgggtafcggg ggttaacgac 
aatgccggag aaacctacaa cgcggtgctg 
atcgaccaga cggtcaacac cagctggggc 
tacgatctca gcgctgaagc gggtaccgcg 
tataacgaca ccttctfccgc caccgcaggc 
gtgtacagct ccggcaccgg aacgtggctg 
cggctttcga cggtggg-cgt gacggctaac 
tttaacacct cgacatttac caatatcgaa 
ctgaccggca gcagcggcga taaccaactg 
atcggcaacg gcggccacga caccttgctc 
ggcggcaacg gctcagacgt ggtgaatggc 
gacaccgatc gcattgatat tcgtgaactg 
aaagccagct acgtcaacgg cgtggcaacg 
tttgtcaaag tcacccagag cggcagcgac 
ggcggcactt ttgcgacaac taacgtggtc 
accttgctgg cgaatcatca gttgatggtg 
<212> Type : DNA 
<211> Length : 158 76 

SequenceNamei : SEQ ID 660 
SequenceDescrription : 

Sequence 



tccattgccg 
ggtacgggct 
caggtcaccg 
gacggctatc 
tggaacaacg 
acggtgggcg 
aacaatgacg 
ctgtcttcgc 
accttcctga 
gatttcgatg 
tacggctcgc 
acggcaacca 
atggatatcg 
gccagcaact 
gcggcaatgg 
agcgtgttcc 
atcaccgatc 
gcgggagtgc 
acctcggcgc 
atcaaatcca 
ggtttacagg 
agcaacaacg 
accgacactt 
gcgaacggcg 
ttaagcagca 
ggcatttccg 
gaagggcgcg 
tataaactgc 
tttacggtag 
ttacagggca 
ctggatgcgc 
accatcgtgc 
acgctgacgg 
gtgtag 



attacgaccg 
atattaacgg 
tcggcaccct 
tcgacttctg 
caggcacgct 
gggcggtgac 
gcaggatcga 
tcatcaacca 
gcggggcggg 
gtgacggcga 
tgttattcaa 
cctacgccag 
cccgtatcgc 
.ggacgcaatc 
actacgactg 
tgagccgcaa 
ccaacggcat 
tggtcgccac 
tggtcaactt 
ccggcaccac 
ccaccgatgc 
gcaagttcgt 
acgacggttc 
ggatggacgt 
ctgccgcgca 
gctcgaattt 
gcggcaacga 
tcaacgccag 
ggacatggga 
gcggctacac 
aggccggaaa 
agatcgaccg 
gcgtgcacaa 



taacggcacg 
ctttaccaat 
gacgtggtac 
gattggtgac 
ggtaggcaac 
ggggtatctt 
cctggttcag 
ggggaatggg 
cagtggcgct 
tatggatctc 
caccaacggc 
ccagtttagc 
ccagaccggg 
ggccctcggc 
ggacggcgcg 
caccaacacg 
taacgtctat 
gcaaatcatc 
ctacgggctg 
cgccagcaat 
cactcacgct 
cggcaccggc 
cggcggctgg 
ggttgatttc 
ggccaccggt 
taacgacatt 
cacgctcaac 
cgacgccact 
aggcaccgcc 
cggcaacggc 
catcggtgac 
cgacggcacg 
cgacctcgcc 



65 



<213> OrganismName : Escherichia 
<400> PreSequenceString : 
ttgggtgttc atacagccga agccacactg 
aatattgcgc ccgatgccag caacgcgcag 
gtgacgaata acagcgacag cgtgcagctg 
ccggtggcgg gaataaccjgt gaacttcacc 
ctcgaaaata acggtattgc catcacccag 
ggtaaaaaag cgggtac^ca tacggttacc 
tcacagccgg taacgtttgt ggcggacaaa 
aaaaatgaga tcacaggtaa tggcgtcgat 
cagtfccgaca atgaggtgaa caatcttccg 
accctgaccc caggggaaag taataccaat 
ggcgttgcct ttggtgagca gacggtcact 
aacaaaactg tgcattttat tggcgacaca 
gtcccagaca gcataatcgc aggtaccccg 
acagtcgttg ataataatgg ctttccggtg 
gcagcgacag ccgaaatgac gaatggcggt 
accgtcactt ataccaatac ccgctcctcg 
gaggccagtc tggaaaatgg tagctccacg 
gcgtctacgg cacatctcac cttgctacag 
actaccaatc tgtatattga ggtgaaggat 
gtaaccctca gcgtttca.cc aagtgaaggt 
accaatcacg acggcaatfct ttacgcaagc 
gtgacggcaa ccctcgaaaa tggcgattcg 
gtagcgaatg ctgaaatctc gctggcagcc 
gatctcacga cactaacagc aacagtcgct 



coli 0157:H7 



cctaacggca 
gttacgctga 
acggcgacgg 
atgccacagg 
gccaatgggg 
gcaacgctga 
acctcggctc 
agcgcaacgc 
gtaacattca 
gagtctggca 
gcatcactgg 
gcggcggcaa 
cagaacagct 
aaaggtgtga 
caagccgtga 
atagaatcag 
cttagcacat 
gcactttttg 
aattacggca 
gtgaccccca 
tttaccgcta 
atgcaacaaa 
tcgaaggatc 
gatacagagg 



acaacgatac 
acatccctgc 
tgaaagatcc 
acgtggcggc 
aagcgcatgt 
gtaataacaa 
tggttgttct 
taactgcaac 
gcacagcttc 
tcgcgcaggc 
ctaataatgg 
aaattatcga 
ccggcagcgt 
ctgtgaactt 
cgaacgaaca 
gagcgagacc 
caattaatgt 
atacagtctc 
acggagtacc 
gtaataacgc 
caaaagccgg 
cagtgaccta 
cggtaattgc 
gcaatgcgat 



gaagatagtc 
tcaacaggtg 
gtcgaatcat 
aaactttacc 
cacgctcaaa 
taccagtgat 
tcagatatca 
ggtcaaagat 
ttcaggcctc 
cactctcgca 
tgccagcgac 
gttgacgcct 
catcaccgcc 
caccagcaac 
gggtaaggct 
ggataccgtt 
caacgctgat 
cgcaggcgac 
ccagcaggag 
tatatatacg 
ggtataccaa 
tgtgccgaac 
caacaataac 
agccaacagt 



13920 
13980 
14040 
14100 
14160 
14220 
14280 
14340 
14400 
14460 
14520 
14580 
14640 
14700 
14760 
14820 
14880 
14940 
15000 
15060 
15120 
15180 
15240 
15300 
15360 
15420 
15480 
15540 
15600 
15660 
15720 
15780 
15840 
15876 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
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gaggtaacat ttactctgcc ggaagatgtg agggcgaact tcacgctggg cgatggcggt 150 0 

aaagtggtta ctgatactga aggcaaagcg aaagtcacgc tgaaaggtac aaaagcaggc 1560 

gctcatactg ttacagcatc gatggctggc ggtaagagtg agcagttggt ggtgaacttt 162 0 

attgcggata cactcactgc gcaggttaat cttaacgtta ccgaggacaa ttttatcgct 1680 

5 aataacgtcg ggatgaccag gctgcaggca acagtgactg atggaaacgg caacccgtta 1740 

gccaatgagg cggtgacatt cacgctaccg gcagatgtga gcgcaagctt tactctcgga 18 00 

caaggcggtt ccgccattac tgacatcaac ggcaaggctg aagttacact gagcggtaca 1860 

aaatccggca cctaccccgt gacagttagc gtgaacaatt atggtgtcag tgatacgaaa 1920 

caggtgactt tgattgccga tgctggtacc gcaaaactag cctccttaac ctctgtatac 1980 

10 tcattcgtcg tcagcacgac cgagggcgcg accatgactg caagcgtcac tgacgctaac 2040 

ggcaacccgg tagaaggtat aaaagttaat ttccgcggaa cttccgtcac gctaagcagc 2100 

accagcgttg aaacggatga tcggggtttc gctgaaattc ttgtgacaag caccgaggtc 2160 

ggactgaaaa cagtttcagc ctctctggca gataaaccta ctgaagtcat ctcgcgatta 222 0 

ctgaatgcaa aagcaga.tat taattctgca acgattacca gtctggagat acctgaaggt 22.80 

15 caggtcatgg tcgcacaaga cgtagcagtt aaagctcacg tcaacgacca gtttggcaat 23 40 

ccgattctta atgaatctgt aacattcagt gcagaaccac cagagcacat gaccatcagc 240 0 

caaaatattg tctctactga tacgcatggt atagccgagg tcactatgac gcccgaaaga 24 60 

aacggttcgt atatggtgaa agcatccctg gcgaatggat cctcttatga gaaggatctg 2520 

gtggtaatcg atcaaaaact gacactctcg gcgtccagcc cgcttatcgg tgtcaattcc 2580 

20 ccaacaggtg caactctgac ggcaacgcta acttctgcaa atggcactcc agtggagggt 2640 

caggtcatca actttagcgt aacgccagaa ggtgcgacgt taagtggcgg aaaagtgaga 27 00 

accaactctt caggtcaggc tccagtcgtt ctgaccagca ataaagtcgg tacatatacg 2760 

gtgactgcat cgttccataa cggcgtaaca atacagacac agacaatcgt gaaagtcact 2820 

ggcaactcaa gcaccgccca tgttgctagc tttatcgctg atccatcgac tatagccgcc 28 80 

25 accaacagtg atttaagtac cttaaaggca acggttgagg atggcagtgg taacctgatc 2940 

gaaggtctca ctgtgtactt cgccttaaaa agcggctctg ccacattaac gtcattaaca 30 00 

gcggtgacag atcaaaacgg aatcgcgaca acaagcgtga gaggagcgat aacggggagc 30 60 

gtcacggtaa gcgcagtcac gaccgctggt ggaatgcaaa cagtagatat aacgctggtg 3120 

gcaggcccgg cagacgcctc gcagtccgtc cttaagaaca atcggtcatc attgaaagga 3180 

30 gactttaccg atagtgctga gctacatctt gttctgcacg atatatcagg caatccgatc 3240 

aaagtttctg aagggctgga atttgtgcag tcaggtacca acgcgcccta tgtgcaagtt 33 00 

agtgcaattg actacagtaa aaatttctca ggcgagtaca aagccactgt tacaggcggc 33 60 

ggagagggta tcgcaacgct gatccctgta ttgaatggtg ttcatcaagc gggtctgagt 3420 

accacaatac aattcactcg cgcagaagac aaaataatga gcggtacagt gttagtcaat 34 80 

35 ggtgctaacc taccgacaac tacattccct tcgcaggggt tcactggggc gtattatcag 3540 

ttgaataatg acaactttgc cccaggaaaa acggcggctg attatgagtt ttcaagctct * 360 0 

gcctcctggg ttgatgttga tgctaccggt aaagtgacat ttaaaaatgt cggcagcaaa 3 660 

tgggagagga ttacggcgac gccaaaaaca ggcggcccta get at a tat a cgaaatccga 3720 

gtgaagagtt ggtgggtgaa cgccggcgat gctttcatga tatacagect tgctgaaaat 3 78 0 

40 ttttgcagta gcaatggcta cacacttccc cttggagacc atttaaacca tagtegttec 3840 

cgaggcatcg ggtcactgta cagtgaatgg ggagatatgg ggcattacac gactgaagct 3 90 0 

ggttttcatt caaatatgta ttggtcatcg agtcccgcaa actcaaacga acaataegta 3960 

gtttccctgg caacaggtga teaaagegta tttgaaaagc ttgggtttgc ttatgegaca 402 0 

tgttataaaa acctctga 4038 

45 <212> Type : DNA 

<211> Length : 403 8 

SequenceName : SEQ ID 661 
SequenceDescription : 



50 Sequence 



<213> Organ i smName : Escherichia coli 0157 :H7 
<400> PreSequenceString : 

atgagcctga ttattgatgt tatttcgegt aaaacatccg tcaaacaaac gctgattaat 60 

55 cctggcgacg tcacggttgt tatttatgag ccttccgtgg tgcaggttca tgctcaggcc 120 

tctgccgttg cgcgttacgt ccgtgaagga aatgacctgc tgatctatat geaggaegge 180 

aeggtgatec getgeaaegg ttatttcctg caageggega atacagctga acaateggaa 240 

ctggtgtttg ccgatggtca acagctaacc catatcacct ttgecgatae tgctgcgggt 3 00 

ggattagece cegtagaact gactgcccag accactgcga ttgaaagcat tgcgccattt 3 60 

60 cttgataccg ttgetcagae cagcgccttc ccgtggggtt ggctggcggg ggcggcggta 42 0 

ggtggtggcg cgcttggtgc actgetggea agcggtggcg atggcgactc gaaaacagaa 480 

gtgattaata accctacgcc acctgetgag cctggcaacg ccacaccatc atttttagtt 540 

accgataatc agggegatea gegeggcatt ctagccacca atgacatcac cgatgacacc 60 0 

acgccaacct ttagcggcag egggcaggeg ggggegacta ttcagattaa agacagtaac 660 

65 ggcaatacta ttgecagtae tcaggtagac aacaaeggtc actggagtgt ctcgttaccc 720 

aegcaaagtg caggtgaaca tacctggtca gtggtgcaaa ttgtcggcag taccatcact 78 0 

gaegceggtt egataaegtt aaccatcgac aatagtcagg ccagcgtgca ggttgccacc 84 0 
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accgcaggcg ataacattat taacgccagc 
accagtagcc atctggcgca gggaacagaa 
acgaccagcg taggcgctaa cggtcjcctgg 
gcgttaggcg aagggaatca ggcggtgctg 
5 gtcaccggcg cgcagctact aacgcjtcgat 
atcgctcagg acaacattat cagtgjctgcg 
acgtcgaatg cagaagcggg gcaaaccgta 
gcaaccgtcg gtagcgacgg aacctggcaa 
ctggcggagg gtaattacgc tgtcaatgcc 

10 agccacagcg cgaatttcac ggtagacacc 
gcgggcgacg atattcttaa taatcjccgag 
gtcagcggtg cttctccagg cgatacggta 
ggcatcgtgc tggcagatgg cagctggaat 
ctggatcgcg gagccaatac gattfcfccgtc 

15 gcggcgtctc ga'gcaatcac gctgcjtcggt 
tccggcgatg acattatcag tggcgcagaa 
actcaacagg ctgagacagg acaaaccgtc 
actaccgtgc aggccgatgg ctcctggagt 
ctgcctgacg gcgcggtggc gattaccgct 

20 aacacttccc gcaccattac cgtcgatagc 
accgctgata acatcattaa cgccgccgaa 
accgacgctc agccggggca gacgcgtgacc 
gtcgtgcagc cagacggcac ctggagcgtg 
gctgacggca acgctacggt caccgccagc 

25 gtttcacgcg tggcgctggt ggatcgccacg 
accgataacg tcatcaacac gccggaacat 
actggcgctc aggcgggcga tatcgtcacc 
gtggtggatg gttccggcaa ctggagtctg 
gcggacggca gttatcctgt cagcgtctcg 

30 cagtcattga ccgtcacggt caataccgcc 
ggcgatgatg tgattaacgc cagcgaaaaa 
gatcagcctg ttaacaccgc cafccaccgtg 
accgacgcct ccggcaactg gagcgtcacc 
caggccaact atacggtaac ggcgcjcggtg 

35 agccataacg tgctggtcga cagcgcgctg 
gacgatafcta ttaacgccgc cgaacjcgggc 
ggcgcggaag atggcgacac ggtaactatt 
gtgggcagca atctcacctg gagccgtggac 
aatggcgatt taacggttaa tgcctcagtc 

40 acgcgggata tcaccatcga cgccraatctg 
gatgatgtgg tcaatatcat cgagoacggg 
gggctggctg aaagcacgcc gcttaccgtt 
gtgcaggccg atggtagctg gagcgtgggc 
gcggggacgg ttaatattgc cgtttcaggg 

45 acgcatccgg tgacggtgga tctcactccg 
gacgatgtga ttaacgccgc agaaaaaggc 
aacgtagaac ccggtcaaac cgtcaccgtc 
gtagcgagcg atggtagctg gactgccacc 
gagggcagcg cctccgcact ggccagcgtc 

50 gtgcacaact acagcgtcga cagcagcgcg 
gacaatatcg tcaacgccag cgaagccgat 
gccgaagcgg ggcagattgt tacgataacg 
gcaacggtgc aggcggacgg cagctggagc 
ttgaccgatg gcagccacac cctgaccgcc 

55 agcaccacgc ataatctggc ggtgcgatctc 
gcgggcgatg acattattaa cgccaccgaa 
agcaccggcg gagaagcggg ggatgtcgtc 
accaccctgg acgcctccgg caactrggagc 
cttggcagcg gcccgcaaac tgtcaccgcc 

60 aattag 

<212> Type : DNA 
<211> Length : 4386 

SequenceName : SEQ ID 662 
SequenceDe script ion r 

65 



gaacaggccg ccgggtttac gctttctggc 90 0 

ctcaccgtta cgctaaacgg caaaacctac 960 

agcgtgcagg tgccgaccgc cgatgcacag 1020 

gtcagtggga aagacgccac aggcaatacg 1080 

acccaaccgc caacgcttgc catcaacacc 1140 

gaacataacg tcgcgctggt actgagcggc 12 00- 

acactgaccg tcaacgggaa aagccataca 12 60 

gtgacgctgc ctgccacgga agtccaggca 1320 

agtgtcagcg atcgggcagg gaacaccacc 13 80 

tcagcacccg tggtcagtgt taataccgtg 1440 

caggccgtcg cgcagatcat ctccggacaa 150 0 

acggtgaaat tgggcactca tgtcctgacg 15 60 

gtggcgctgg acccagcggt aacccgcacg 1620 

accgtgacag atgctgcagg, aaatactggc A 68 P 

gtttctccgt tgatcaccat taacaccgtct 1740 

aaaggtgcgc cactgaccct taccggtagc 18 0 0 

acagtaaccc tggctggaca gagttttacc 18 60 

ctgacggtac ctgccgccgc gatgggaaat 192 0 

tctgtgacgg atctcagcgg caataccggc 1980 

caggccccgg ccttaagcat tgatccactg 2 040 

agcgggcagg atctgcccat caccggcacc 2100 

gttacgttaa atgggcagac gtatcagggc 2160 

actgtgcccg ccgccaacgt gggcgcactg 222 0 

gtgaacgatg tcgccggtaa tccgagcagc 22 8 0 

ccgccggtgg taaccattaa tccggtggcg 23 40 
gctcaggcgc aaatcatcag cggcacggtt " 24 0 0 

gtgacgctga ataatgtgga ttacaccacg 2460 

ggcgttccgg cctcggtggt cagtgggctg 252 0 

gtaaccgaca aagccggaaa cacgggcagc 2580 

gcgcccctta tcggcattaa cagcattgcg 2640 

ggggccgatc tccagattac cggcaccagc 270 0 

acgctgaacg ggcaaaatta caccaccacg 2760 

gttccggcat cggcggttac agcattaggc 282 0 

accagcgata tcggcaacag cgccactgcc 2880 

cccggtgtga ccattaatcc ggtggcaacc 294 0 

gtggcgcaaa ccatcagcgg gcaggtgact 3 000 

acgttgggtg gtaatactta tacggcgacg 3060 

gttccagcgg cagatattca ggcgctggga 3120 

accaatcaaa acggcaacac cggcagcggc 3180 

cccggcctgc gggtcgatac ggtggcgggc 3240 

caggcgctgg tggtcaccgg cagcagctcg 33 00 

acgattaata atgtggaata caccactgcg 3 3 60 

gtcacggcgg cgcaggttag cgcctggcct 3420 

gaaagtagcg ccggaaactc ggtgagcatt 348 0 

gcagcgatca ccatcaacac catcgccacg 3 540 

gctgatttaa ccctttccgg caccaccact 3 60 0 

acctttggcg ggaaaaatta cactgccagc 3 660 

gtacccgccg ccgatctggc gtcattaccc 3 72 0 

agcaatatca acggcaatag cgcctcggcg 378 0 

ccaaccatca ttatcaatac cgtcgccagc 3 840 

gcgggcgtga cggtgagcgg cagtaccacc 3900 

cttaacagcc cgaccgtgca gacctatcag 3 960 

atcaatattc cggcggcaga tcttgaggca 402 0 

acggtcaatg acaaagcggg caatccggcg 408 0 

accgttccgg tgctgaccat caacaccatt 4140 

cacgggcagg cgctggtgat ttccggttcc 4200 

accgtcacgc taaacagtaa aacctacacc 42 60 

gtcggcgttc cggcggcgga tgtcacggcg 432 0 

acggttaccg atgcggcagg caacagcgac 43 80 

4386 



Sequence 
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<213> Organi smName : Escherichia 
<400> PreSequenceStrincj- : 
atgaatagaa tctatcgcgt gjatatggaat 
gaattaactc gcagggtagg fcaaaacatca 
5 acaacgaaat tcagtagatt gacgctgggt 
ggtgcgagtc tggaagttga fcaatggtcag 
gatgcctacc tggttggctg gtatggcact 
gcctccttaa ccactattac taccagcgtc 
gttaatgttt tgggtggcac ctggcgattg 
10 aatgtgggtc aatccggaac cjgggacgctg 
ggctatttaa gattaggtac fccaagctgca 
gactcggttt tgacgaccga attattcgaa 
attacggata agggttacgt cacgagttca 
■ " agtaatggta aggttgtcgt ..fcgaaaagggt 
15 tcaattgaat ttcaaattgg taatcaagga 
ctgattacgg ctgaaaatac cjattatcggt 
gtgcaggatc aagactctgt catcacggta 
ggcgcagtca atatttccaa taatggacta 
gttcaggacg gttcccacgg tgtcgtcaac 

20 ggaacgggcg aagctttccg ctatatctat 
gtctcgcgtg aaggtaaagt agattcggga 
acaggcaacc ttactgttaa ggataagaac 
ggttatgacg gccacggcga aatgaatatc 
ggaagctctc tcggttatgg agaaaccggc 

25 atatgggagg tcaataagaa tgtctatacc 
aatattagcg atggcggtaa cjttcgtatcg 
agcggtatcg gcacactgaa cctgatggat 
aatgtcggta attttggtag cggtatcgta 
acgggctatg gatttatcgg aggaaatgcc 

30 gatagtctct ggaatttaaa gacgtcttct 
ttaggcacgg gtgaactgaa tatfcaccacc 
atagctctca atgacaaaag fcaagggcgac 
gaaacattca atatgtacgfc agggacatct: 
ggcacgctga atgtcgaagg tggagaagtt 

35 acgctaaaca ttggfcgcagc fccacggtgag 
gcgacgaaag tagagtttgg ctctggcgaa 
agtgatgccg gctaccaggfc cgafcatgctg ^ 
atccatgatg caggccatac ggtgttcaat 
gtcaatgacg gcctcctgac crafctgcgtct 

40 tcgagtgaag taaccattgc aagccccggt 
gcaggagatt acacgctgac craatgcgctc 
tcatcctccg acaagatgtt tggctttaca 
gcccaactga aagacagtac cttcactctg 
gcgatgttgc agtctgacat tgaaaatacc 

45 ggtggactgg ccatgaatgg cggtacgctc 
cttgcagagg gatatatcag cgtcgatacg 
aaaggccgta actatcaggt aaacgggacg 
tggaatgatc ctatggcgaa taaccctctg 
aaccatgtcg gcgttcaacfc cjgtgaaggcg 

50 acgttacgtg atttacaggg ccjacgaggtg 
aacggaacgg tggtcgccga gggtgattat 
ggtttgtacg ttaactatgg gctgaaagcg 
ttagccgaac atggcggagc ctatggcgca 
gaaggggatc tggcaatcaa tacggtgcga 

55 tatcaggggg caacctacgt tcagatgggg 
ggcaacaccc gggaactgaa catcagcaac 
cagacggtag agacattcac cgggcagatg 
ctgacggtaa ataaaggtgg gatcagtcag 
gttacagggg gaacgctggc tgtcgagggg 

60 gttagcccaa atgcggaagt cagcctcgat 
gccaatgacg gtctgttaac gctaaaaaac 
gggaagggta tcgtgagcgc aaccgccagg 
cgctttgtgg ggcaattcaa cattgataca 
aacctgggtg atgcttccgt tatcaataat 

65 tgggcgatga cgcacagtat cagcggtagc 
ctgactctta acaacgattc ctcggcgtat 
attgctttcg gttccgactc tgccattaat 



coli OX57:H7 

tgcactctac aggtatttca ggccfcgctcg 60 

acggttaatt tgcgaaagtc ctctggactg 12 0 

gttttgctgg cactaagcgg ttcagtgtct 180 

attaccaata ttgatacfcga tgttgcttat 24 0 

ggagtgctta atattttggc tggcggtaat 300 

attggcggta atgaggattc ggagggtacc 360 

tatgatagcg gaaataatgc aaggccttta 420 

aatattaaac agaagggtca cgtcgatgga 480 

ggcgtcggga cggttaatgt tgagggagag 540 

atagggagtt acggtacggg ctcattaaat 600 

atagtcgcca ttttaggata tcaagcgaac 660 

ggcgagtggc taataaaaaa taatgattcc 720.. 

actggggagg cgactattcg cgagggtggg 780 

ggcaatgcca ccggtgtcgg aaccctgaat 840 

cgcagactct ataatggata tttcggtaat 900 

attaataaca aagaatatfcc attggtgggc 960 

gtgaccgata aagggcattg gaatttcctc 1020 

atcggtgatg ctggcgacgg tgaacttaat 1080 

attatcactg cggggatgaa agaaacaggc 1140 

tccgttatca ctaatctcgg aactaatctt 1200 

agtaatgagg ggcttgttgt cagcaacgga 1260 

gtcgggaagg ttagcatcac cacggggggg 132 0 

accattggtg ttgcgggcgt cggaaacctc 13 80 

caaaatatta cttttttggg cgataaagca 1440 

gcgacatcgt cgttcgatac tgtgggtatc 1500 

aatgtcagta atggtgccac ccttaattca 1560 

tccggtaagg gaatagttaa tatttcaacg 162 0 

actaacgccc aattgctaca ggtcggtgfca 168 0 

g9 a ggtatag ttaaagcgcg tgatacacag 174 0 

gtgagggtgg atgggcagaa ctctcttctt 180 0 

ggtacgggta cgttaaccct gacgaatagc 1860 

tacttaggtg tttttgaacc tgctgtagga 1920 

gcggcggcag atgccggatt tatcaccaac 1980 

ggtgtttttg tctttaatca tactaataac 2040 

afctacaggtg acgataaaga cggaaaagtg 2100 

gcagggaata cttatagcgg taaaacgctg 2160 

catacggcag atggggtaac gggcatgggg 222 0 

acgctcgaca ttctcgcatc aacgaacagt 2280 

aaaggcgatg gcttgatgcg agttcagctg 2340 

catgcaacag ggactgaatt cgccggtgtt 2400 

gaacgcgaca acaccgctgc gcttactcac 2460 

acatcggtaa acgtgggaga gcaatccatt 2520 

attttcgata cggatattcc tgctgcgacg 2580 

ctggttgtcg gcgcgagtga ctacacctgg 2640 

ggcgacgtgc tfcatcggcgt gcctaaaccg 2700 

acgacgctca atttgctgga acacgatgat 2760 

caaacggtta ttgggtcggg tggctcatta 2820 

gaagcggaca aaacgttaca cattgcgcaa 288 0 

ggattccgcc tcacgaccgc accaggtgat 2940 

ctgaacatcc atggagggca aaagctgacg 3 00 0 

acggccgata tgtcggcaaa aatcggtggt 3060 

caggtttcgc tttccaacgg tcagaacgac 312 0 

acattacgta ccgatgcgga tggcgcgctg 3180 

gcagccatcg tcgatcttaa tggatcgacg 3240 

ggttcgactg ttttgttcaa agaagggtcg 33 00 

ggtgaactga caggtggcgg aaacctgaat 33 60 

cttaatgcac gctacaatgc gttaaccagc 342 0 

aatacgcagg ggttaggcag aggaaatatt 34 80 

gtgactggcg aactgcgtaa tagcataagc 3540 

acagatgtag agctggatgg cgataatagc 3600 

ggcagcgcgc tcagcgtcaa cgagcagaaa 3660 

ggcctgctca ccatctccac tgagcgtagc 3720 

ggtgatttga caaaactggg taccgggatc 3780 

cagggtacga cggatatcgt ggggggggaa 3840 

acggcaagtc aacacattaa tatccataac 3900 
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agcggtgtga tgtcgggaaa tgtcaccact gcaggtgatg tgaacgttat gtctgggggg 3960 

acactgcgtg tcgctaaaac cacaatcggc gaatcggcgg caacctggag aatggcggca 4020 

cggttcaaat ga 4032 
<212> Type : DNA 
5 <211> Length : 4032 

SequenceName : SEQ ID 663 

SequenceDescription : 

Sequence 

10 

<213> OrganismName : Escherichia coli 0157:H7 
<400> PreSequenceString : 

atgggcatca aacaacacaa tgggaatacc aaagccgatc gtctcgctga attaaaaatc 60 

c-^cgtt.qgccct caattcaact, gataaaattt ggcgctattg.gtttgaafcgc aattctcttt 12 0. 

15 tcccccctgc tgatagctgc tgatacagga agtcaatatg gcaccaatat tactattaat )180 

gatggtgaca gaattactgg agataccgcc gatccatcag gaaacctcta tggtgtaatg 240 

accccagcag gaaacacgcc tggcaatatc aacctgggta atgatgtcac cgtcaatgtc 3 00 

aacgacgcct ctggatatgc aaaaggaatc attattcagg gcaaaaacag ctccctgaca 3 60 

gctaaccgac tcacagtaga tgttgttggt caaacctctg ccatcggcat taatttaatfc 420 

20 ggtgactata cccatgctga cfctaggcaca ggcagcacca ttaagagtaa cgatgacggc 480 

atcattattg ggcatagctc aacactaaca gccactcaat tcaccattga aaactcgaac 540 

ggtataggcc taaccatcaa tgactatggc accagtgtcg atcttggaag cggaagtaaa 600 

atcaagaccg atggaagtac aggtgtttat atcggtggtc tcaacggcaa taacgccaat 660 

ggtgctgcgc gttttacggc gacagacctg acaatcgatg ttcagggcta cagcgccatg 72 0 

25 gggataaacg tacagaaaaa ctctgttgtc gatctcggaa caaacagttc cattaaaacc 780 

agtggcgata atgcacacgg cctctggagc tttggccagg ttagcgcgaa tgcactcact 840 

gttgatgtaa ctggagccgc ggccaatggc gtcgaagttc gtggtggtac aaccactatc 9 00 

ggtgcagata gccatatttc ttccgcgcag ggcggtggtc tcgtcaccag tggttcagac 960 

gcgacaatca atttttctgg cacggcagcg caacgaaaca gcatcttttc cggcggttct 1020 

30 tatggtgcct cggcccagac ggcaacggct gttatcaaca tgcaaaatac cgatattacg 1080 

gttgatcgta atggcagtct ggcgctgggt ttgtgggcgc tcagcggcgg tagaataacc 1140 

ggagacagtt tggctatcac cggcgcggca ggagccaggg ggatttatgc catgaccaac 1200 

agccagatrcg acctcacgag cgatctggtc attgafcatga gtacacccga ccagatggcc 1260 

atcgcaacgc aacatgacga tggttatgcc gccagccgca tcaacgcctc gggtcgtatg 1320 

35 cttatcaacg gtagcgttct tfcccaaaggt gggctaatca atctggatat gcaccctggg 1380 

tcggtttgga caggttcctc cctcagcgat aatgtcaatg gcgggaaact ggacgttgca 1440 

atgaataaca gcgtctggaa cgtaacaagt aattctaatc tcgacacgct. ggcgctgagc 1500 

cattcaactg tcgattttgc cagccacggg tcaactgccg gcacatttac cacattaaac 1560 

gtagagaacc tgagcggtaa eagtaccttt attatgcgtg ctgatgttgfc tggcgagggt 1620 

40 aatggcgtta agccctgggc ctga 1644 
<212> Type : DNA 
<211> Length : 1644 

SequenceName : SEQ ID 664 
SequenceDescription : 

45 

Sequence 



<213> OrganismName : Escherichia coli 0157:H7 
<400> PreSequenceString : 

50 gtggggatcg acagccgtaa tcfatattcct gaggggattg cgacgctggg cgcttttatg 60 

ggttattccc attcacatat cggttttgat cgtggaggac atggcagtgt ggacagttat 120 

tctctgggcg gctatgccag ttgggaacat gaaagtggtt tctatctgga cggtgtcgtg 180 

aagctgaacc gttttgaaag taacgtagcc ggtaaaatga gcagcggtgg agccgccaat 240 

ggcagttacc atagcaacgg gctgggcggt cacattgaaa ccgggatgcg atttaccgat 3 00 

55 ggtaactgga acctgacgcc gtatgcctcg ttaacggggt tcaccgctga taaccccgaa 3 60 

tatcatttat ccaatggcat ggaatcgaaa tcagtcgata cccgcagtat atatcgtgaa 420 

ct gggtg c aa cgctgagtta caacatgcgt ctggggaacg gtatggaagt tgagccgtgg 480 

ctgaaggcgg ctgtgcgcaa agaatttgtc gatgataacc gggtgaaagt gaatagtgac 540 

ggtaatttcg tcaatgattt gtcgggcaga cgtggaatat accaggcagg tattaaagcc 600 

60 tcattcagca gtacgttaag cgggcatctt ggggtggggt atagcaacgg tgctggtatg 660 

gaatccccgt ggaacgcggt ggctggtgtg aactggtcgt tctga 705 
<212> Type : DNA 
<211> Length : 705 

SequenceName : SEQ ID 665 



65 SequenceDescription : 



Sequence 



WO 2005/076010 



300/341 



PCT/IN2005/000037 



<213> Organi sniName : Escherichia 
<400> PreSequenceString : 
atgaaaaaaa aggttctggc aatagctctg 
5 caggctgctg acgtaacagc tcaggctgta 
accaccagta agctggttgt gacgccactc 
attaaaggtt ttaactcaca gaaaggtcta 
gctaccgcct ttaaactgac ctcacgtctt 
tcaggttcca cactgaatgt gggcgtggat 
10 gataccgtga tgatcgatac cgccaacggc 
aacggttaca atgccagcaa tcgtaccacc 
agcggcacca ccaatggtac caccgcagta 
tggagcggcg acgttagcgt acagttcgac 
<212> .Type : DMA ... 
15 <211> Length : 588 *, 

SequenceName : SEQ ID 666 
SequenceDe script ion : 

Sequence 

20 

<213> OrganismName : Escherichia 
<400> PreSequenceString : 
atgacggcag aatcctacga tgataactac 
accgggcagg ggcagaaatc tgcaggtgat 

25 gaggaaggcc agaaagggct tataggctgg 
atccgttttc cgaatggcac ggtggatgtg 
gccgtgacgg cgaaagaagt gatcacccgc 
tctgtagcgg aagaacgcag caaaattacg 
tccggtacgg tggcaaaagg gaaaacaacc 

30 gcaacagaca agacgttcag agcggtttcc 
aaagatatga caattacggt aaacggcgtg 
agcggaaatg gtcagttcgc cgcagtggct 
gggtaa 

<212> Type : DNA 
35 <211> Length : 606 

SequenceName r SEQ XD 667 
SequenceDescription : 

Sequence 
40 

<213> OrganismName : Escherichia 
<400> PreSequenceString : 
atgacggcag aatcctacga tgataactac 
accgggcagg ggcagaaatc tgcaggtgat 

45 gaggaaggcc agaaagggct tataggctgg 
atccgttttc cgaatggcac ggtggatgtg 
gccgtgacgg cgaaagaagt gatcacccgc 
tctgtagcgg aagaacgcag caaaattacg 
tccggtacgg tggcaaaagg gaaaacaacc 

50 gcaacagaca agacgttcag agcggtttcc 
aaagatatga caattacggt aaacggcgtg 
agcggaaatg gtcagttcgc cgcagtggct 

gggtaa 

<212> Type : DNA 
55 <211> Length : 606 

SequenceName : SEQ ID 668 
SequenceDescription : 

Sequence 

60 

<213> Organi smName : Escherichia 
<400> PreSequenceString : 
atgttatata atataccttg tcgaatttat 
gggatagttt ctactgcaac cgcaacttct 
65 ctcgtcgtga ccacgaatcg ttcggcaagc 
gttattgacc aacaaacatt gcagaactcc 
gacatccccg gagtagagat aacagacaac 



coli 01_57:H7 



gtaacgg-tgt ttaccggcat gggtgtggcg 60 

gcgacctggt cagcaacagc caaaaaagac 120 

ggtagcctgg cgttccagta tgccgaaggc 180 

tttgacg-tgg ctatcgaggg tgactcaacg 240 

atcaccaaca cattaaccca gttggatacc 3 00 

tataacgcgcg cggcagtcga aaaaactggc 3 60 

gtactgg-cjcg gcaaccttag cccgctggca 420 

gcacagga.tg gtttcacttt ctccatcatc 480 

accgattaca gcactctacc ggaaggcatc 540 

gcgacctgga ccagttaa 588 



coli 0157:H7 



ctggatgatg aagacgcgga ctggaccgcg 60 

accagtttta cgctggcctg gaaaccggga 120 

tttgaaacjcg gcgatgtccg ggcctataaa 180 

tttcgtgcjct gggtcagcag tatcggtaag 240 

acggtgaa.ag tcactaacgt gggtaaacct 3 00 

ccggtcagtg cgattaaggt gacgccgaca 3 60 

accctgacgg tttcttttga gccggaaagt 420 

gccgatccgt cgaaagccac cattagtgtg 4 80 

gcgacaggta aggtgcagat ccctgtggtg 540 

gaagtcaccg ttactgaagc gggcgctgca 600 

606 



coli 0157:H7 



ctggatgatg aagacgcgga ctggaccgcg 60 

accagtttta cgctggcctg gaaaccggga 120 

tttgaaacgcg gcgatgtccg ggcctataaa 180 

tttcgtggct gggtcagcag tatcggtaag 240 

acggtgaaag tcactaacgt gggtaaacct 3 00 

ccggtcacgtg cgattaaggt gacgccgaca 3 60 

accctgacgg tttcttttga gccggaaagt 42 0 

gccgatccgt cgaaagccac cattagtgtg 48 0 

gcgacaggta aggtgcagat ccctgtggtg 540 

gaagtcaccg ttactgaagc gggcgctgca 60 0 

606 



COli 0157:H7 



atcctttcca ctctgtcatt atgcatttct 60 

tcagaaacaa aaatcagcaa cgaagagacg 12 0 

aacctttcggg aaagcccggc gactatacag 180 

accaatgcct ccatagccga taatttgcag 24 0 

tccttggcag gccgtaaaca aatccgcatt 3 00 
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cgtggcgaag catcctcccg tgttttaatt ctcattgatg gtcaggaggt aacttatcag 3 60 

cgcgccggag ataattatgg tgtgggactg fctgatagatg agtctgcgct ggagcgtgtt 420 

gaggtagtga aaggtccata ttccgtactg tacggttcac aggcaattgg cggtattgtt 48 0 

aacttcatca ccaaaaaggg aggtgacaaa cttgcatctg gagttgtgaa agctgtttat 540 
5 aattccgcaa cagcaggctg ggaagaatca atcgcggtcc aggggagcat cggtggattt 60 0 

gattatcgca tcaacggtag ttattctgat cagggcaatc gtgatacgcc ggatggacgt 660 
ctgccgaata ccaactatcg taacaatagt cagggtgtat ggttgggtta taactccgga 720 

aaccatcgtt ttggcctctc gcttgatcgc tacagactcg cgacacaaac ttactatgag 780 
gatccagacg gaagttatga ggcatttagt gtcaaaatac ctaaacttga acgagagaaa 84 0 

10 gttggggtat tctatgacac agacgtggac ggtgactatc taaaaaaaat tcatttcgac 900 

gcgtatgagc agaccatcca gcgccaattt gccaacgaag taaaaacgac acagcctgtt 960 

cccagtccga tgattcaggc tctgaccgtt cataacaaga ctgacaccca tgataagcaa 1020 

tacactcagg cggtcacatt gcagagtcac ttttcgctgc ctgctaataa tgaacttgtt 10 80 

accggtgcac agtacaaaca agatagggtc agccaaaggt ccggfcggcat gacctcaagc 1140 

15 aaatctctga ccggcttcat taataaggaa acacgaactc gctcctatta tgagtcagag 12 00 

caaagtacag tctcactatt cgcacaaaat gactggcaat tcgccgatca ctggacatgg 1260 

acaatgggag ttcgccaata ctggctttct tcaaagttga cgcgtggtga cggagtatca 1320 

tataccgcag gcattataag cgatacctct cttgccagag agtctgcgag tgatcacgaa 13 80 

atggtaacat ctacaagcct gcgctattca ggtttcgata acctggagtt acgcgctgcg 1440 

20 ttcgcgcaag gctacgtatt tcccacactc tcccagcttt ttatgcagac atctgcgggc 1500 

ggcagtgtca catacggaaa tcctgatctt aaggctgaac actccaataa ctttgaatta 1560 

ggtgcacgat ataatggtaa tcagtggctg attgacagcg cagtttacta ctcagaagct 1620 

aaagattata ttgcaagtct gatctgtgat ggcagtatag tttgcaatgg taacaccaac 168 0 

tcctcccgta gtagctacta ttattatgac aatattgatc gggcaaaaac atggggactg 1740 

25 gaaataagcg cggaatataa tggctgggtt fctctcgccat atatcagtgg caatttaatt 180 0 

cgtcggcaat atgaaacttc aacattaaaa acaactaata caggtgaacc agcgataaac 1860 

ggacgtatag ggctgaaaca tactcttgtg atgggtcagg ccaacataat ctctgatgtt 192 0 

tttattcgtg ctgcctctag tgcaaaagat gacagtaacg gtaccgaaac aaatgttccg 198 0 

ggctgggcca ctctcaactt tgcagtaaat acagaattcg gtaacgagga tcagtaccgg 2040 

30 attaacctgg cactcaataa cctgacagac aaacgctacc gtacagcaca tgaaactatt 2100 

cctgcagcag gttttaatgc agctataggt tttgtatgga atttctga 2148 
<212> Type : DNA 
<211> Length, t 2148 

SequenceName : SEQ ID 669 



35 SequenceDescription r 

Sequence 



<213> OrganismName : Escherichia coli 0157 :H7 

40 <400> PreSequenceString r 

atgacaaaaa tgagtcgcta cgccttgatt accgcgctgg cgatgtttct cgccgggtgt 60 

gtggggcaac gtgaacctgc accggtagaa gaagtgaaac cagcgccgga acaaccagcc 12 0 

gagccacaac agcctgtccc cacagtgccc tcggtgccga cgatcccgca gcagccaggc 18 0 

ccaattgagc acgaagatca aactgcaccg cctgcgccgc atattcgcca ttatgactgg 240 

45 aatggcgcaa tgcagccgat ggtcagtaag afcgcttgggg ctgacggggt gactgcgggt 3 00 

agcgtcctgc tggttgatag cgttaacaac cgtactaacg gttcgctgaa tgccgcagaa 3 60 

gcgaccgaaa cgctgcgaaa tgcgctggct aataacggga aatttaccct ggtttccgcc 420 

cagcagctgt cgatggctaa gcaacagtta ggtttgtcgc cgcaggatag tttaggcacc 480 

cgtagtaaag ccataggcat tgcccgcaat gfccggcgctc attacgtgct gtactccagc 540 

50 gcctctggca acgttaacgc tccgacccta caaatgcagc tgatgctggt gcagacgggc 600 

gaaattatct ggtcaggtaa aggtgccgtt tcgcagcaat aa 642 
<212> Type : DNA 
<211> Length : 642 

SequenceName : SEQ ID 670 



55 SequenceDescription : 

Sequence 



<213> OrganismName : Escherichia coli 0157 :H7 

60 <400> PreSequenceString : 

atgaaaagca aagtactggc acttttaatt cctgccctgc tcggcgcagg tgctgcacat 60 

gcagccgaag tttataataa agacggcaac aaattagatc tgtatggcaa agttgatggc 120 

ctgcattatt tttctgataa ttcagcgaaa gatggcgacc agagctatgc gcgtctgggt 180 

tttaaaggcg aaacccaaat taacgatcaa cttactggct acggtcaatg ggaatacaat 240 

65 attcaggcaa acaacactga atcttcaaaa aaccagtcat ggacccgtct ggcatttgcc 3 00 

ggactgaaat tttcagatta cggttctttc gattacggac gtaattatgg gctggaccga 360 

tatgctgcct ga 372 



WO 2005/076010 



PCT/IN2005/000037 



302/341 



<212> Type : DNA 

<211> Length : 372 

SequenceName : SEQ ID 671 
SequenceDe script ion : 

5 

Sequence 



<213> OrganismName : Escherichia coli 0157 :H7 
<400> PreSequenceString : 

10 atggcaacac caaatcccct tgagccggta aaaggtgccg gtaccactct gtgggtttac 60 

aacggcaagg gtgatgctta tgcaaacccg ttgtcagacg atgactggca gcgactggct 12 0 

aaggtgaagg atctgacgcc gggcgagatg acggcagaac cctacgatga taactacctg 180 

gatgatgaag acgcggactg gaccgcgacc gggcaggggc agaagtctgc aggagatacc 24 0 

agttt-tacgc tggcctggaa accgggagaa gaaggtcaga- aagggcfctat aggctggttt 300 

15 gaaagcgggg atgtgcgggc ctataaaatc cgtttcccaa atggca_cggt ggatgtgttc 360 

cgtggctggg tcagcagtat cggtaaggcc gtgacggcga aagaagtgat cacccgcacg 42 0 

gtgaaagtca ctaacgtggg caaaccttcc gtggcggaag aacgcagcga aattacgccg 48 0 

gccactgcaa ttaaggtgac accgacatcc ggtacggtgg caaaagggaa aacaaccacc 54 0 

ctgactgttt cttttgagcc ggaaagtgca accgacaaga cgttca.cjagc ggtttccgcc 600 

20 gatccgtcga aagccaccat tagtgtgaaa gatatgacaa ttacgg-taaa cggcgtggcg 660 

acaggtaagg tgcagatccc tgtggtgagc ggaaatggtc agttccjccgc agtggctgaa 72 0 

gtcaccgtta ctgaagcggg cgctgcaggg taa 753 
<212> Type : DNA 
<211> Length : 753 

25 SequenceName : SEQ ID 672 

SequenceDescription : 



Sequence 



30 <213> OrganismName : Escherichia coli 0157 :H7 
<400> PreSequenceString r 

atggcaacac caaatcccct tgagccggta aaaggtgccg gtaccactct gtgggtttac 

aacggcaagg gtgatgctta tgcaaacccg ttgtcagacg atgactggca gcgactggct 

aaggtgaagg atctgacgcc gggcgagatg acggcagaac cctaccjatga taactacctg - 

35 gatgatgaag acgcggactg gaccgcgacc gggcaggggc agaagtctgc aggagatacc 

agttttacgc tggcctggaa accgggagaa gaaggtcaga aagggcttat aggctggttt 

gaaagcgggg atgtgcgggc ctataaaatc cgtttcccaa atggcacggt ggatgtgttc 

cgtggctggg tcagcagtat cggtaaggcc gtgacggcga aagaagjtgat cacccgcacg 

gtgaaagtca ctaacgtggg caaaccttcc gtggcggaag aacgcagcga aattacgccg 

40 gccactgcaa ttaaggtgac accgacatcc ggtacggtgg caaaagjggaa aacaaccacc 

ctgacggttt cttttgagcc ggaaagtgca accgacaaga cgttcagagc ggtttccgcc 

gatccgtcga aagccaccat tagtgtgaaa gatatgacaa ttacgcjtaaa cggcgtggcg 

acaggtaagg tgcagatccc tgtggtgagc ggaaatggtc agttcg-ccgc agtggctgaa 
gtcaccgtta ctgaagcggg cgctgcaggg taa 

45 <212> Type : DNA 

<211> Length : 753 

SequenceName : SEQ ID 673 
SequenceDescription : 

50 Sequence 



<213> OrganismName : Escherichia coli 0157 :H7 
<400> PreSequenceString : 

atgggctgga ccgatatgct gcctgaattt ggcggtgact cttataccaa tgcagacaac 60 

55 tttatgactg gtcgtgccaa tggcgtcgcg acttatcgta atactgjattt cttcggtctg 120 

gtaaatggtc tgaacttcgc ggtgcagtat caaggtaaca acgaacjgtgc cagtaatggt 180 

caggaaggca ccaacaacgg acgtgatgtt cgccatgaaa acggtgjacgg ctggggtctt 240 

tccacaacat atgatttagg catgggcttt agcgctggtg cggcatacac ctcttctgac 3 00 

cgcaccaatg accaggttaa ccatactgcg gcgggtggtg ataaacjcaga cgcgtggact 3 60 

60 gctgggctaa aatacgatgc taacaatatt tacctggcaa ccatgtiattc agaaacgcgt 420 

aatatgaccc cgtttggcga cagcgattat gctgttgcaa acaaaaccca gaactttgaa 480 

gtcactgcac agtaccagtt tgattttggt ctgcgtccgg cagtctcttt cctgatgtct 54 0 

aaaggccgtg acctgcacgc tgccggtggt gcagacaacc cggcagjcjtgt tgatgataaa 600 

gatctggtta aatacgccga tgttggcgcg acttactatt tcaataaaaa catgtccact 660 

65 tatgttgact ataaaatcaa cctgttggat gaagatgaca gtttctacgc tgccaacggc 720 

atctctaccg atgatattgt cgctttaggt ctggtttatc agttctaa 768 
<212> Type : DNA 



60 
120 
180 
240 
300 
360 

4r2 0 

480 
540 
600 
660 
720 
753 
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<211> Length : 768 

SequenceName : SEQ ID 674 
SequenceDescription : 

Sequence 

<213> OrganismName : Haemophilus influenzae Rd 
<400> PreSequenceString : 

atggggttta ttatgaaact aactaaaaca gcattatgta ctgcactttt tgccaccttt 60 

actttttcag cgaacgcaca aacttatcct gatttgccag tggggatcaa aggaggaaca 12 0 

ggcgcattaa ttggcgacac agtttatgtg ggattaggtt ctggtggcga taaattctat 18 0 

accttagacc taaaagatcc ttcagcacaa tggaaagaaa ttgctacatt tccgggtggc 240 

gaacgcaatc aacctgttgc agccgcagtg gatggaaaac tttatgtatt cggtggttta 3 00 

caaaaaaatg aaaaaggcga acttcaactc gttaatgacg cttatcgcta taacecaagc, 3 60 

gataatactt ggatgaaact tcctactcgt tctccccgtg gtttagtagg atcaagtggt 42 0 

gcatctcacg gagataaagt ttatatttta ggcggctcta atctctctat atttaatggc 48 0 

ttcttccaag acactgtggc agcaggtgaa gataaagcga aaaaagatga aatcgcagca 54 0 

gcttatttcg atcaacgtcc agaagattat ttctttacaa cagaattatt gagctatgaa 600 

ccatcaacca ataaatggcg caatgaaggt cgtattccat tctctggtcg tgctggtgca 660 

gcctttacaa ttcaaggtaa tgagctagtg gttgtcaatg gcgaaattaa acctggactt 72 0 

cgcaccgctg aaacccatca aggtaaattc actgctaaag gtgtgcaatg gaaaaactta 78 0 

cctgacttac ctgctccaaa aggcaaatca caagatggtt tagctggtgc gctttcaggc 840 

tatagtaacg gtcattattt agtcactggc ggcgcaaatt ttccaggttc aatcaaacaa 90 0 

ttcaaagaag gaaaacttca cgcacataaa ggtttaagca aagcttggca taacgaagtt 9 60 

tatacgttga ataatggtaa atggcgcatt gttggggaat taccaatgaa tattggttat 102 0 

ggtttttctg tatcttacaa caataaagtt ttactgattg gcggtgaaac tgatggaggt 10 8 0 

aaggctttaa ctagcgtcaa agcaataagc tatgacggta aaaaattaac catcgaataa 114 0 

<212> Type : DNA 
<211> Length : 1140 

SequenceName : SEQ ID 675 

SequenceDescription : 

* " i 

Sequence 

<213> OrganismName : Haemophilus influenzae Rd 
<400> PreSequenceString : 

atgggagaac aatatatgct tacgacgata ctcagcttcc tcattgtaac cactgtggtt 60 
gcgtatgttt cttggttaaa aacaaaaggg gatgatttaa aatcttcaaa agggtatttt 12 0 

ttggcagggc gtgggttaag cggtttagta attggctgct caatggtgct gacatcgctt 18 0 

tcaactgaac aacttatcgg tgtcaatgcg gtgtcctata aaggtaattt ttccgtgatt 240 
gcttggacag ttccaacggt tattccgctt tgtttcttgg ctctttatat aattggctgg 3 00 

ctataa 

<212> Type : DMA 
<211> Length : 306 

SequenceName : SEQ ID 676 

SequenceDescription : 

Sequence 



306 



<213> OrganismName : [Helicobacter pylori, strain J99 
<40 0> PreSequenceString : 

atgaaaaatc aacacaaaaa tcccctaaca aaagctttaa tgaaaactta tccatataac 60 

cattttttat ttttctgctt tattctagga gcgtttttat taggtttgct cagtccagct 12 0 

tatgctttaa gtattatcac cactaaagaa attgacgcta atttgcttaa tggagcgata 180 

gaaagcaggg tggtgttagg caagagggtg tttaaagtag aagctcatgg gttttatttt 240 

agaaacaatg cgactaacag catagatata gaaatcacca gtcttttaag agacaatcaa 3 00 

tcgtttcctt tgactagcag tgctaaaacc agtttaaaaa tacctcctaa cgccaagatt 3 60 

aaaaaatcca ctatccttgt tttgaaaggc gagaacgctg aagaagtggc taagatttta 42 0 

ggcgttagca aagaagaata ccaaaagcta gaaaacatcg ctcaaaccaa agcggctaat 480 

gaccctatgt atgctaacac gccttttagt aatggttctg atagttcctt ttacgataac 540 

aatcctaata gccctagcaa taacgctatc aatggcaaag atggcgcaaa tgggagtaac 60 0 

ggctatgggg caaatggcaa tgatggggta aatgggatca gtgggagtaa tggtgcaaat 660 

999 a 9tcatt caaataataa tgcaataggc agtggtattg atacagatgg cgtgttaggg 720 

gtggatgggg tgaatggctc tagttcttca agtggcggct ctgtaggggg ttatgagaat 780 

aatttcacta atcatggctc tactaacaat aacacaggag ggtatgacaa ttttaataat 840 

ggcagctcaa gtggtgggag tttagggaat ggggggcttt tccctattcc ttttggtaat 900 
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ggagacacaa acaattccaa taattccact aacaccacta gcccaactaa tgg-cagtagt 960 

tctaataacg ccactaatcc tagttcgcaa gaaaacaatt actccagcca gtattgtaaa 102 0 

gtgccagagt taagccccaa caacacgatg aaactagatg ttatcgctaa agetfcggctct 108 0 

tgtatttcta tgaacgcttt aagagatgac actaaatgcg cttatagata cgattttgaa 1140 

gccggtaaag ccatcaagca aacgcaatac tactatgtag atagggaaaa taaaacgcaa 12 0 0 

aatatcggtg gttgtgtgga tttacaaggc gctcaatacg ccatgcaact ttacaaagat 12 6 0 

gacagcaaat gcgccttaca aaccacgagc gataaaggtt atggtatggg gaaaacgcaa 132 0 

acctttcaaa ctgaaatcgt gtttcgtggg atggacaatt taatccatgt cgctgtgcct 13 8 0 

tgcagcgatt atgcaagggt gcaagacagg attgttaggt atgaaaaaaa tgataaaacc 1440 

caaaccttaa cgcctatagt ggatcagtat tataatgatc ctaacaaccc taacaagcaa 150 0 

gagattttaa atcgtgggat tgccacccaa ttaagctcgc aatatcaaga atttgcatgc 1560 

ggtcaatggg aatacaatga cgctaaatta gaagccaaaa gacctacaat gctaaaaagc 162 0 

tataacaagc ttaatggaga atgggtagaa gttacgccct gtaattttga agcragggatt 168 0 

. aaaagcggtg cggttgttag cccttatgtg„ atgggcgtgc ctagttctaa agtcttaagc 1740 

gatattacta caagccatta ttttaggata gaaaggaaaa attatggtga gagagaacaa 180 0 

tgccaaaaac tttatggagt caatcgttgc caaccgcaat attccatact gatcctagta 18 60 

tcaccgattg gagcgccact tacaaaacca ctaccaccca aaccactcaa ccttatttac 1920 

gcccagccca agataatgaa aaacacccca caacctataa tcttatcacc actcaaacca 198 0 

ccatcaacag gactcaaagc gttttga 2 0 07 
<212> Type : DNA 
<211> Length : 2007 

SequenceName : SEQ ID 677 

SequenceDescription : 



Sequence 



<213> OrganismName : Helicobacter pylori, strain J99 
<400> PreSequenceString : 

atgcctgtta taagagtttt agtaatgctt gcaacaatga tgatgaaatt agtaaaaacg 60 

gcaaaagaaa agaaagtttt taagaatgtg ggaatatcta taatggggat tgotttttgg 120 

gaagcgataa aagactcgat aaaaaaacaa attaaaaaaa gcgattggat atcjcgggaat 180 

gttaagactg cggatgatta tttaaaaacg catcctaact catggtttaa ttcagcaata 240 

ggtgtaacag cgataacagc catgcttatg aatgtgtgtt ttgctgatga ccaatccaaa 3 00 

aaagaagtgg ctcaagctca aaaggaagct gaaaacgcta gggatagagc gaacaagagt 360 

gggatagaac tggaacaaga agagcaaaag acagaacaag aaaaacaaaa gacagaacaa 420 

gaaaaacaaa agacagaaca agaaaaacaa aagacagaac aagaaaaaca aaagacagaa 480 

caagaaaaac aaaagacaag caatatagag actaacaatc aaataaaagt agaacaagaa 540 

caacaaaaga cagaacagga aaaacaaaag acaaacaata cgcaaaaaga ttfcggttaac 60 0 

aaagcagaac aaaattgcca agaaaatcat aatcaattct ttattaaaaa atfcaggaatt 660 

aaggctggca ttgctataga aatagaagct gaatgcaaaa cccctaaacc cacaaaaacc 72 0 

aatcaaaccc ctatccagcc aaaacacctc ccaaactcca aacaacccca ttctcaaaga 780 

ggatcaaaag cgcaagagct tatcgcttat ttgcaaaaag agctagaatc tctgccctat 840 

tcacaaaaag ctatcgctaa acaagtggat ttttataggc caagttctat cgcttattta 900 

gaactagatc ctagagattt taacgctaca gaagaatggc aaaaagaaaa tttaaaaata 960 

cgctctaaag ctcaagctaa aatgcttgaa atgaggagtt taaaaccaga cccacaagcc 102 0 

cacctttcaa cctctcaaag ccttttgctc gttcaaaaaa tatttgctga tgttagtaaa 108 0 

gaaataaaag tagttgctaa taccgagaaa aaagtagaaa aagcgggtta tggttatagt 114 0 

aaaaggatgt ag 1152 
<212> Type : DNA 
<211> Length : 1152 

SequenceName : SEQ ID 678 

SequenceDescription : 



Sequence 



<213> OrganismName : Helicobacter pylori, strain J99 
<400> PreSequenceString : 

atgaactacc ctaatctacc taacagcgct ttagagataa gcgaacagcc agaagtgaaa 60 

gaaatcacta acgagctttt aaagcaatta caaaacgctt taaggagcaa cgcgcatttt 120 

agcgagcaag tggaattaag ccttaaatgc atcgttagga ttttagaagt gcfctttgagt 18 0 

ttggattttt ttaagaatgc gaatgagatt gatagcagtt taagaaattc cafctgagtgg 240 

ctgactaacg ccggcgagag cttgaaatta aaaatgaaag aatacgagcg cttttttagc 300 

gagtttaata cgagcatgca tgccaacgag caggaagtaa ccaatacctt aaacgctaac 3 60 

gccgagaaca ttaaaagcga aattaaaaag ctagaaaatc aattgataga aaccacgaca 42 0 

agacttttaa cgagctatca aatcttttta aaccaagcca gagataacgc taacaaccaa 480 

atcacaaaaa acaaaaccca aagccttgaa gcgattacac aagctaaaaa caacgctaat 540 

aatgaaataa gcaacaatca aacgcaagcg ataactaata tcaccgaagc gaaaacgaac 60 0 
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gctaataatg aaataagcaa caatcaaacg caagcgataa ctaacattaa cgaagccaaa 66 0 

gaaagcgcta caacgcaaat aaacgccaat aagcaagaag caataaataa catcacgcaa 720 

gaaaaaaccc aagccacaag cgagatcacc gaagcgaaaa agaccgatca ttatcaaaac 78 0 

attgattttt ttgagtttga ataa 804 
<212> Type : DMA 
<211> Length : 804 

SequenceName : SEQ ID 679 

SequenceDescription : 

Sequence 



<213> OrganismName : Helicobacter pylori, strain «J99 
<400> PreSequenceString : 

gtgaaatttt..tttcaaagga tttgtttaaa aaagtaactc ctttatttt.t ...aagcgttt at - 50 
tttttaagcc ccacccttac gcaagccaaa agccgttttt atgtggcttc tcaataccag 12 0 

gtggggaaaa tgatcatgaa aaaatacaac gatctcaaac gcacgattga aggggcgagc 18 0 

ttttctttag gctgggagat taaccccact aattactggt tttattcgcg ctattacfctt 240 
tttatggatt acgggaatgt cattctcaat aaaagaacgg gcgctcaagc gaacatgttc 30 0 

acttacggct ttggagggga tttgatcatg gaatacaata aaaacccttt gtatgtatfct 3 60 

tctctttttt atggcatgca agttgctgaa aacacatgga cgatttccaa acacagcgcg 42 0 

aatttcatca ttgacgactg gcgcagcatt caagggtttt cgctcaaaac ttcaaatttc 48 0 

aggatgttgg gtttagtggg gtttaaattc caaaccgtgc tattccacca tgacgctagt 54 0 

attgaagtgg ggatcaaatg gccttttgct tttgaatacg actcaccctt tgtaaggctt 60 0 

ttttctgtct ttatttcgca cactttctac ctttaa 63 6 

<212> Type : DNA 
<2H> Length : 63 6 

SequenceName : SEQ ID 68 0 

SequenceDescription : 

Sequence 



<213> OrganismName : Helicobacter pylori, strain J99 
<400> PreSequenceString : 

atgaaaaaat ttaccctatc gctatttttg tgttgcacct tacttaacgc tgaagaggjat 6 0 

atttttagga acaacacaaa tgaaactgat cttacaaatt cttttgaaca tggcaaagaa 12 0 

aacaacaatc ttatcccagc aaaatccgat agtttagaaa gtttcaaaga acaagaaaac 18 0 

aaagaaaaag ccaaacagct tatggattta aaggccttac agagcgtgta tttttctaaa 240 
aatagaaaat tgcaagacaa taatttcaat gtcttgtatg tggcaggcaa caccaacaaa 3 00 

atccgcttac gctatgcgat gactaccacc tttatttttg ataatgatcc tattatctat 360 
gtgagtttag gagatcctag cgattttgaa ctcacttacc ccactaatga tcattacgjat 42 0 

ttgtctaaca tgctagtgat taagccgtta cttatagggg tggatacgaa cctaaccgjta 480 
gtcggagcga gcggcacaat ttatacctta ttatttgttt ag 522 
<212> Type : DMA 
<211> Length : 522 

SequenceName : SEQ ID 681 

SequenceDescription : 

Sequence 



<213> OrganismName : Mycoplasma pneumoniae 
<4 00> PreSequenceString : 

gtgttggatt atgtgccctg gattgggaat gggtacaggt atgggaataa ccaccggcjgg 60 

agcaacagta gtacgagtgg ggtaacgacc cagggacaat cccaaaatgc atccagtaac 12 0 

gaacccgcac cgacgttttc gaatgtcggc gttggtctca aagccaatgt caacggca.cc 18 0 

ttaagtggtt cgcgaactac gcctaatcag caaggcactc cctggttaac cctcgaccaa 24 0 

gccaacctcc agctctgaac gggcgcgggg tgaaggaatg ataagaacgg acaaagtgjac 3 00 

gaaaactaca ccaatttcgc gagtgctaag ggcagtacga accagcaggg ttcaacaacg 36 0 

ggtggatcag cgggcaaccc cgactcgtta aagcaggata aggctgataa aagtggtgjat 42 0 

tcggtaacgg ttgcagaagc cacatcgggc gacaacttga cgaattacac caacctcccc 48 0 

ccaacatcac ccccacatcc gactgaccga acgcgctgtc attcaccaac aagaacaacg 540 

cccagcgggt gcagctgttc ctgcgcggcc tgttgggcag catcccggtg ttggtcaata 60 0 

agagtgggca agatgataac agtaagttta attccaccga ccaaaaatgg tcttacaccg 660 
aattaa 

<212> Type : DNA 
<211> Length : 666 

SequenceName : SEQ ID 682 

SequenceDescription : 



666 
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<213> OrganismName : Mycoplasma pneumoniae 
5 <4 00> PreSequenceString : 

gtggatgata taaccgcgcc tcaaaccagc gcggggtcgt ccagcggaac tagtacgaac 60 

acaagtggtt cgcgttcctt tctcccgacg ttttcgaatg tcggcgttgg cctcaaagcg 120 

aatgtccagg gcaccctcgg gggcagacag acgacgacta cggggaacaa tattcccaaa 180 

tgagccaccc tcgaccaagc caacctccag ctctgaacgg gggcggggtg aaggaatgat 240 

10 aagactacaa gtggttcaac cggcaatgcc aatgacacca agttcacgag tgctacgggc 3 00 

agtgggagtg ggcagggcag ttcttcaggt acaaatactt ccgcggggaa tcctgatggg 3 60 

cttcaggctg ataaagttga tcaaaacggt caggtgaaaa caagcgttca agaagccact 420 

tcaggggaca acttgacgaa ttacaccaac ctcccccccg ccaacctcac ccccaccgct 48 0 

gattgaccga aogcgctgtc attcaccaac aagaacaacg .cgcagcgcgc ccagctgttc,.. 540 

15 ctgcgcggcc tgttgggcag catcccggtg ttggttaata agtccggcca agatgataac 60 0 

agtaagttta aggcggagga ccaaaaatgg tcctacaccg acttacagtc ggaccaaacc 660 

aaactgaacc tccccgctta cggcgaggtg aatgggttgt tgaatccggc gttggtggaa 72 0 

acctattttg ggaacacgcg agcgagtggt tcggggtcca acacgaccag tfccacccggt 780 

atcggtttta aaattcccga acaaagtggc acaaacacaa cgtcgaaggc tgtgctgatc 840 

20 acccccgggt tggcttgaac gccgcaagac gttggtaaca tcgttgtcag tggcaccagc 900 

ttcagcttcc agctcggcgg gtggttagtt acgttcacgg actttatcaa accccgcgct 960 

ggttacctcg ggctccagtt aacgggtctc gatgtaagtg aagcgaccca aagggagtta 102 0 

atttgggcca agcggccctg agcggccttt cgtggcagtt gggtcaaccg gctgggccgc 10 8 0 

gtggagagtg tgtgggattt caagggggtg tgggcggatc aagctcagtt ggccgcgcaa 1140 

25 gcagctacaa gtagtaccac caccaccgca acaggggcta ccttaccgga gcacccgaat 12 0 0 

gccctcgcgt accaaattag ctataccgac aaggattcgt acaaggcttc cactcaaggt 12 60 

tcgggtcaaa ccaattccca aaacaattcg ccctacctcc attttattaa acctaagaaa 13 2 0 

gtcgaaagca cgacccaact cgaccagggc ttaaaaaacc tgttggaccc caaccaggtt 13 8 0 

cgcaccaagc tgcgccaaag ctttggtaca gaccattcca cccagcccca gccccaatcg 144 0 

30 ctcaaaacaa cgacaccggt atttgggagg agtagtggta acctcagtag tgtgtttagt 1500 

ggtgggggtg ctggaggggg ttcttcaggc tcaggtcaat ctggcgtgga tctctccccc 1560 

gttgaacggg tgagtggtca ctaa 1584 
<212> Type r DNA 
<211> Length. : 1584 

35 SequenceJrame : SEQ ID 683 

SequenceDe script ion : 



Sequence 



40 <213> OrganismName : Mycoplasma pneumoniae 
<400> PreSequenceString : 

gtgttaaaac ttgctgttgg tatttttatt tcccccacgc tcaccaggtt tagtaccggg 60 

ttcaacctcg cggggtcggt gctcgaccag gtgttggatt atgtgccctg gattgggaat 120 

gggcacaggt atgggaataa ccaccggggc gtggatgata taaccgcgcc taaaaccggc 180 

45 gcggggtcgt ccagcggaac tagtacgaac acaagtggtt cgcgttcctt tctcccgacg 240 

ttctcgaatg tcggcgttgg cctcaaagcg aatgtccagg gcaccctggg tggcagtcag 3 00 

acgacgacta cggggaaaga tattcccaaa tgacccaccc tcgacccagc caacctccag 3 60 

ctctgaacgg gggcggggtg aaggaatgat aaggcttcaa acaaacaaag tgacgaaaac 420 

cacaccacct ttaaaagcgc tacgggcagt ggccagcagg gtggttctac aacgggtggt 480 

50 tcagcgggca atcccgactc gttaaagcag gataagatta gtaaatcagg tcagaactta 540 

accacgcagg acggcgcgcc ccagtctaat tcgacgacgg aatccgcgtc gaattatgat 600 

cacctccccc ccaacctcac ccccacatcc gattgaccga acgcgctgtc attcaccaac 660 

aagaacaacg cgcagcgcgc ccagctcttc ctccgcggct tgttgggcag catcccggtg 720 

ttggtgaatc gaagtgggtc agatgattcc aacaaattcc aagccaccga ccaaaaatgg 780 

55 tcttacaccg acttaaagtc ggaccaaacc aagctcaacc tccccgctta tggtgaagtg 84 0 

aatgggttgt tgaatccggc gttggtggaa acctattttg ggacgacgcg agcgggtggt 900 

tcggggtcca acacgaccag ttcacccggt atcggtttta aaattcccga acaaaataat 960 

gattcgaagg ctgtgctgat cacccccggg ttggcttgaa cgccgcaaga cgttggtaac 102 0 

ctcgttgtca gtggtaccag cttgagcttc cagttgggcg ggtggctggt caccttcacg 1080 

60 gactttgtca aaccccgcgc gggttacctc gggctccagt taacgggctt ggatgcaagt 1140 

gatgcgacgc agcgcgccct catttgggcc aagcggccct gagcggcctt tcgtggcagt 12 00 

tgggtcaacc gcttgggccg ggtcgagagt gtgtgggatt taaagggggt gtgacaagat 12 60 

caagctcagg cggccgcgca agcagctacc accgccgccg caacagggga cgccttaccg 132 0 

gagcacccca atgccctcgc gtaccaaatt agctccaccg acaaggattc gtacaaggct 13 80 

65 tccactcaaa gctccggtca aaccaattcc caaaacacct ccccctacct ccatttgatt 1440 

aaacctaaga aagtcgaaaa cacgacccaa ctcgaccagg gcttaaaaac ctgttggacc 1500 

ccaaccaggt tcgcaccaag ctgcgccaaa gctttggtac agaccattcc acccaagcca 1560 
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aaccccaatc cctcaaaaca accacaccgg tgtttgggac gaatagtggt aacattggca 1620 

gtgtgcttag tggtgggggt gctggaggag cagacagcac caattcggtg gacctctccc 1680 

ccgttgaacg ggtgagtggg tggcttgtgg ggcaattacc cagtgggggt ggggggaata 1740 

gtagtgagga tattaaaagt gtgcaagaca ctcctattta tatctatatt cattagtata 18 00 

5 ttctttttaa attgttccct cactttattc atatgaacca ccgcctccct cgcgacggga 18 60 

ctcaccgtag tgggacactt cacaagtacc accacgacgc tcaagcgcca gcaatttagc 1920 

tacacccgcc ctgacgaggt cgcgctgcgc cacaccaatg ccatcaaccc gcgcttaacc 1980 

ccgtgaacgt atcgtaacac gagcttttcg tccctccccc tcacgggtga aaaccccggg 2040 

gcgtgggcct tagtgcgcga caacaccgcc aagggcatca ctgccggcag tggcagtcaa 2100 

10 caaaccacgt atgatcccac ccgaaccgaa gcggctttga ccaccgccac cacctttgtg 2160 

ttacgccgat acgacctcgc cgggcgttgt acgacctcga cttttcgaag ttaa 2214 



<212> Type : DNA 
- <211> Length :~*2214 
15 SequenceName : SEQ ID 684 

SequenceDe script ion : 

Sequence 



20 <213 > OrganismName : Mycoplasma pneumoniae 
<400> PreSequenceString : 

gtgttggatt atattccctg gattgggaat gggcacagat atgggaatga ccaccggggt 60 

agcaacagta gtacgagtgg ggtaacaacc cagggacaac aatcgcaaaa tgcatccgga 120 

accgaacccg catccacttt ttcgaatgtc ggcgttggcc tcaaagcgaa tgtccagggc 180 

25 accctgggtg gcagtcagac gacgactacg gggaaagata ttcctaaatg acccaccctc 240 

gaccaagcca acctccagct ctgaacgggg gcggggtggc ggaatgataa ggcttcaagt 3 00 

ggacaaagtg acgaaaacca caccaagttc acgagcgcta cgggcagtgg gcagcagggc 360 

agttcttcag gtacaactaa ttccgcgggc aatcccgact cgttaaagca ggataaggtt 420 

gataaaagtg gtgattcggt aacggttgca gaaaccactt caggggacaa cttgacgaat 480 

30 tacaccaacc tccctcccaa cctcaccccc accgctgatt gaccgaacgc gctgtcattc 540 

accaacaaga acaacgcgca gcgcgcccag ctcttcctcc gcgccctgtt gggcagcatc 6 00 

ccggtgttgg tgaataagag tggccaagat gattccaaca agttccaagc caccgaccaa 660 

aaatggfcctt acaccgaatt aaagtcggat caaaccaagc tcaacctccc cgcttacggg 720 

gaggtgaatg ggttgttgaa tccggcgttg gtggaggtgfc acggtctgag ttccactcaa 780 

35 ggttctagca ccggagccgg tggcgctgga ggtaacaccg gaggcgacac caatacccag 840 

acttatgcac gacccggaat cggctttaaa ttaccctcca cggactcgga atcttccaaa 900 

gccaccctga tcacccccgg gttggcttga acagcgcaag atgtcggtaa cctcgttgtc 960 

agtggtacca gcttgagctt ccagctcggc gggtggttgg ttaccttcac ggattttatc 1020 

aaaccccggt cgggttacct cgggctccag ttaacgggct tggatgcaaa tgacagcgac 10 80 

40 caaagggagt taafcttgagc ccccccggcc ctgaaccgcc tttcgtggca gttgggtcaa 1140 

ccgcttgggc cgcgtggaga gtgtgtggga tttcaagggg gtgtgggcgg atcaagctcg 12 00 

gtccgactcg caagcagcta caagtaccac caccgcaacg aaggctacct tatcggagca 12 60 

caccaatgct ttggcctttc aggtgagtta taccgaccag gattcgtaca aggcttccac 13 20 

tcaaagctcc ggccaaaacc aaaacacctc cccctacctg cacttggtgc aggggaaaaa 13 80 

45 agtaggttcc tctgataa 13 98 



<212> Type : DNA 

<211> Length : 1398 

SequenceName : SEQ ID 685 
SequenceDescription : 

50 

Sequence 



<213> OrganismName : Mycoplasma pneumoniae 
<400> PreSequenceString : 

55 ttgttgggca gcatcccggt gttggtgaat cgaagtgggt ccgattccaa caaattccaa 60 

gccaccgacc aaaaatggtc ctacaccgac ttacagtcgg accaaaccaa actgaacctc 120 

tccgcttacg gtgaggtgaa tgggttgttg aatccggctt tggtagaaac ctattttggg 180 

acgacgcgta cgtctagcac tgcgaaccaa aacagtacaa ccgtccccgg tatcggtttt 240 

aaaattcccg aacaaaataa tgattccaaa gccaccctga tcacccccgg gttggcttga 3 00 

60 acgccccagg acgtcggtaa cctcgttgtc agtggcacca cggtgagctt ccagctcggc 3 60 

gggtggctgg tcaccttcac ggactttgtc aaaccccgcg cgggttacct cgggctccag 420 

ttaagtggcc tgaatgccag tgacagcgac caaagggagt taatttgggc cccccggccc 480 

tgagcggcct ttcgtggcag ttgggtcaac cggttgggcc gcgtggagag tgtgtgggat 540 

ttgaaggggg tgtgggcgga tcaagctcag ttggccgcgc aagcagctac aagtagtacc 600 

65 accaccaccg caacaggggc taccttaccg gagcacccga atgctttggc gtaccaaatt 660 

agctataccg acaaggattc gtacaaggct tccactcaag gttcgggtca aaccaattcc 720 

caaaacaatt cgctctacct ccatttgatt aaacctaaga aagtcgaaag cacgacccaa 780 
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ctcgaccagg gcttaaaaaa cctgttggac cccaaccagg ttcgcaccaa gctgcgccaa 840 
agctttggta cagaccattc cacccagccc cagccccaat cgctcaaaac aacgacaccg 900 
gtgtttggag ccatgagtgg taacctcggc agtgtgctta gtggtggggg tgctggagga 960 
gcaggcagca ccaattcggt ggacctctcc cccgttgaac gggtgagtgg gtcactaacc 102 0 
attaatagga atttttctta ttaa 1044 
<212> Type : DNA 
<211> Length : 1044 

SequenceName : SEQ ID 686 

SequenceDe script ion : 

Sequence 



<213> OrganismName : Mycoplasma pneumoniae 
<40 0> Pre§equenceS£ring,„ : 

atgggccagc agggacaatc aggtacctcc gcggggaatc cagactcgtt aaagcaggat 60 

aagattagta agagtgggga tagtttaacc acgcaggacg gcaatgcgac cggtcaacaa 12 0 

gaggccacta actacaccaa cctccccccc aacctcaccc ccaccgctga ttgaccgaac 18 0 

gcgctgtcat tcaccaacaa gaacaacgcg catcgcgccc agctcttcct tcgcggcttg 240 

ttgggcagca tcccggtgtt ggtgaatcga agtgggtccg attccaacaa attccaagcc 3 00 

accgaccaaa aatggtccta caccgactta cagtcggatc aaaccaagct gaacctcccc 3 60 

gcttacggtg aggtgaatgg gttgttgaat ccggcgttgg tggaaaccta ttttgggaac 42 0 

acgcgagcgg gtggttcggg gtccaacacg accagttcac ccggtatcgg ttttaaaatt 480 

cccgaacaaa ataatgattc caaagccacc ctgatcaccc ccgggttggc ttgaacgccc 540 

caggacgtcg gtaacctcgt tgtcagtggc accagcttga gcttccagct cggcgggtgg 600 

ctggtcagct tcacggactt tatcaaaccc cgcgcgggtt atttgggcct ccagttaagt 660 

ggcctggatg ccagtgacag cgaccaaagg gagttaattt gggccaagcg gccctgagcg 720 

gcctttcgtg gcagttgggt caaccggctg ggccgcgtgg agagtgtgtg ggatttaaag 780 

ggggtgtggg cggatcaagc tcagttggcc gcgcaagcag ctacaagtga agcttccggg 840 

tcagctttgg cacctcaccc gaatgctttg gcgtttcagg tgagtgtggt ggaagcgagt 900 

gcttacagct cttcaacctc aagttcgggt tccgggtcaa gttcaaacac ctccccctac 960 

ctccatttga ttaaacctaa gaaagtcgaa agcacgaccc aactcgacca gggcttaaaa 1020 

aacctgttgg accccaacca ggttcgcacc aagctgcgcc aaagctttgg tacagaccat 108 0 

tccacccagc cccaatcgct caaaacaacg acaccggtat ttgggacgag tagtggtaac 1140 

attggcagtg tgcttagtgg tgggggtgct ggagggggtt cttcaggctc aggtcaatct 12 0 0 

ggtgtggacc tctcccccgt tgaacgggtg agtggtcact aa 1242 
<212> Type : DNA 
<211> Length : 1242 

SequenceName : SEQ ID 687 

SequenceDescription : 

Sequence 



<213> OrganismName : Mycoplasma pneumoniae 
<400> PreSequenceString : 

ttgggcctcc agttaagtgg cctggatgcc agtgacagcg accaaaggga gttaatttgg 60 

gccaagcggc cctgagcggc atttcgtggc agttgggtca accggttggg acgggtcgag 120 

agtgtgtggg atttgaaggg ggtgtgggcg gatcaagctc acagcgcggt gtccgagtcg 180 

caagcagcta caagtagtac caccaccacc gcaacagggg acaccttacc ggagcacccg 240 

aatgccctcg cgtaccaaat tagctccacc gacaaggatt cgtacaaggc ttccactcaa 3 00 

ggctccggtc aaaccaattc ccaaaacacc tccccctacc tgcatttgat taaacctaaa 3 60 

aaagttactg cttccgacaa gttagacgac gatcttaaaa acctgttgga ccccaacgag 42 0 

gttcgggtga agctgcgcca aagctttggt acagaccatt ccacccaacc ccaaccccaa 48 0 

cccctcaaaa caacgacacc ggtgtttggg acgaatagtg gtaacctcgg tagtgtgctt 540 

agtggtgggg gtaccacgca ggactcaagc accaccaatc aactgtcacc cgttcaacgg 60 0 

gtgagtgggt ggcttgtggg gcagttacca agcacgagtg acggaaacac ctcctccacc 660 

aacaacctcg cgcctaatac taatacgggg aatgaggtgg tgggggtggg agatttgtct 72 0 

aagcgagctt ccattgaatc aagtcgtttg tgaatagcat taaaacctta g 771 

<212> Type : DNA 
<211> Length : 771 

SequenceName : SEQ ID 688 

SequenceDescription : 

Sequence 



<213> OrganismName : Mycoplasma pneumoniae 
<40 0> PreSequenceString : 
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gtgcgcgata acactgccaa gggcatcact gccggcagtg gcagtcaaca aaccacgtat 60 

gatcctgcgc gaaccgaggc caccttgacc accaccacct ttgcgctgcg ccggtatgac 120 

ctcgccgggc gcgccttata cgacctcgac ttttcgaagt taaacccaca aacgccaacg 180 

cgtgatgcca actgccagat cacctttaac ccctttggcg gctttggttt gagtggcagt 240 

5 gcaccccaac agtgaaacga ggtcaaaaac aaggtccccg tcgaggtggc ccaagacccc 3 00 

accgatcctt atcggtttgc cgttttactc gtgccgcgca gtgtggtgta ctatgagcag 3 60 

ttgcaaaggg gattagcgct ccctaaccaa gggagttcgt caggctccgg tcaacaaaac 420 

accaccattg gcgcgtatgg gctgaaggtg aagaacgccg aggcggacac cgcaaagagc 480 

aatgaaaaac tccagggcga tgaatccaag tcttccaatg gatcttcaag cacttccacc 540 

10 accacccaac gtggttcgac caattccgac accaaagtca aggctttaaa aatagaggtg 600 

aaaaagaaat cggactcgga ggacaatggt cagctgcagt tagaaaaaaa tgatctcgcc 660 

aacgctccca ttaagcgggg cgaggagtcg ggtcagtccg tccaactcaa ggcggacgat 720 

tttggtactg ccccttccag ttcgggatca ggcggcaact ccaaccccgg ttcccccacc 78 0 
Gcctgaaggc cgtggcttgc gactgagcaa attcacaagg acctccccaa atgatccgcc< • - 840 

15 tcgatcctga ttctgtacga tgcgccttat gcgcgcaatc gtaccgccat tgatcgcgtt 900 

gatcacttgg atcccaaggt gatgaccgcg aactatccgc ccagttgaag aatgcccaag 960 

tgaaaccacc acgggttgtg ggactgaaag gcgcgcgatg ttttgttcca aaccaccggg 1020 

tttgatgaaa gtaatacttc gaacaccaag cagggctttc aaaaggaagc tgactccgac 10 80 

aagtcggccc ccatcgccct cccgtttgaa gcgtacttcg ccaacattgg caacctcacc 1140 

20 tggttcgggc aagcgctttt ggtgtttggt ggcaatggcc atgttaccaa gtcggcccac 1200 

accgcgcctt tgagtatttg gctttatata tatttagtaa aggcagtcac ttttaggttg 1260 

cttttagcta actctttatt atcaaaaagc aacatatata aaaaaacagc taattaa 1317 

<212> Type : DNA 
25 <211> Length : 1317 

SequenceName SEQ ID 689 
SequenceDe script ion : 



30 



Sequence 



<213> OrganismName : Mycoplasma pneumoniae 
<400> PreSequenceString : 

gtgcgcgata acattgccaa gggcatcact gccggtagta acacccaaca aaccacgtat 60 

gatcccaccc gaaccgaggc caccttgacc accgccacca cctttgcgtt acgccggtat 120 

35 gacctcgccg ggcgcgcctt atacgacctc gatttttcga agttaaaccc acaaacgccc 18 0 

acgcgcgacc aaaccggtca gatcaccttt aacccctttg gcggctttgg gctgagtggg 240 

gctgcacccc aacagtgaaa cgaggtcaag gataaggtcc ccgtcgaggt ggcccaagac 3 00 

ccctccaatc cttatcggtt tgccgtttta ctcgtgccgc gtagcgtggt gtactatgag 3 60 

cagttgcagc gggggttagc gctccctaac caagggagtt cgtcaggctc cggtcaacaa 420 

40 aacaccacca ttggcgcgta tgggctgaag gtgaaaaacg ccgaggcgga caccgcgaag 48 0 

agcaatgaaa aactccaggg ctatgaatcc aagtcttcca atggatcttc aagcacttcc 540 

accacccaac gtgggggttc gtcaaatgaa aacaaagtca aggcgttgca ggtggcggtg 60 0 

aaaaagaaat ccgggagtca gggcaactcc ggtgaccaag gcaccgaaca ggtggaactt 660 

gaatctaatg atttagccaa cgccccgatt aaacgaggct ccaataacaa ccagcaagtc 72 0 

45 caactcaagg cggacgattt tggtactgcc ccttccagtt cgggatcagg cacccaagat 780 

ggcaccccca ccccctgaac gccgtggtta acgactgagc aaattcacaa cgaccccgcc 840 

aaattcgccg cctcgatcct cattctgtac gatgcgcctt atgcacgcaa ccgtaccgcc 900 

attgaccgcg ttgatcactt ggatcccaag gtgatgaccg cgaactatcc gcccagttga 960 

agaacgccca agtgaaacca ccacggtttg tgggactgaa aggcgcgcga tgttttgctc 1020 

50 caaaccaccg ggttcttcaa cccgcgccgc caccccgagt ggtttgatgg cgggcagacg 1080 

gtcgcggata acgaaaagac cgggtttgat gtggataact ccgaaaacac caagcagggc 1140 

tttcaaaagg aagctgactc cgacaagtcg gccccgatcg ccctcccgtt tgaagcgtac 1200 

ttcgccaaca ttggcaacct cacctggttc gagcaagcgc ttttggtgtt tgggatttgt 1260 

ttgtcttaa 1269 

55 <212> Type : DNA 

<211> Length : 1269 

SequenceName : SEQ ID 690 
SequenceDescription : 

60 Sequence 



<213> OrganismName -. Mycoplasma pneumoniae 
<400> PreSequenceString : 

atgctttggc ctttcaggtg agtgtggtgg aagcgagtgc ttacaagcca aacacgagct 60 

65 ccggccaaac ccaatccact aacagttccc cctacctgca cttggtgaag cctaagaaag 12 0 

ttacccaatc cgacaaagtt agacgacgat cttaaaaacc tgttggaccc caacgaggtt 180 

cgcgccagaa tgctcaaatc atttggtaca gaaaatttca cccaacccca accccaaccc 24 0 



WO 2005/076010 



PCT/IN2005/000037 



310/341 



caagccctca aaacaacgac accggtattt gggacgagta gtggtaacct cggtagtgtg 3 00 

cttagtggtg gggggtacca cgcagggctc aagcaccacc aatcaactgt cacccgttca 3 60 

acgggtgagt gggtggaccg ctaa 3 84 
<212> Type : DNA 
<211> Length : 384 

SequenceName : SEQ ID 691 

SequenceDescription : 

Sequence 



<213> Organi smName : Mycoplasma pneumoniae 
<4 00> PreSequenceString : 

gtgcgcgaca acagcgccaa gggcatcact gccggtagtg aaagtcaaca aaccacgtat 60 

gatcccactc gaaccgaagc ggctttgacc gcatcaacca cctttgcgtt acgccggtat 12 0 

gacctcgccg ggcgcgcctt atacgacctc gactttagta ggttaaaccc acagacacca 180 

acgcgcgacc aaaccgggca gatcaccttt aacccctttg gcggctttgg tttgagtggg 24 0 

gctgcacccc aacagtgaaa cgaggtcaaa aacaaggtcc ccgtcgaggt ggcccaagac 3 00 

ccctccaatc cttatcggtt tgccgtttta ctcgtgccgc gtagcgtggt gtactatgag 3 60 

cagttgcagc gggggttagc gctccctaac caagggagtt cgtcaggctc cggtcaacaa 420 

aacaccacca ttggcgcgta tgggctgaag gtgaagaacg ccgaggcgga caccgcgaag 48 0 

agcaatgaaa aactccaggg cgatgaatcc aagtcttcca atggatcttc aagcacttcc 540 

accaccaccc aacgtggggg ttcgtcaggg gacaccaaag tcaaggcgtt gcaggtggcg 60 0 

gtgaaaaaga aatccgggag tcagggcaac tccggtgaac aaggcaccga acaggtggaa 660 

cttgaatcta atgatctcgc caacgctccc attaagcggg gcgaggagtc gggtcagtcc 720 

gtccaactaa aggcagccga cttcggcacc accccatcca gttcgggatc aggcggcaac 780 

tccaaccccg gttcccccac cccctgaagg ccgtggcttg cgaccgagca aattcacaag 840 

gacctcccca aatgatccgc ctcgatcctg attctgtacg atgcgcctta tgcgcgcaat 900 

cgtaccgcca ttgatcgcgt tgatcacttg gatcccaagg tgatgaccgc gaactatccg 960 

cccagttgaa gaacgcccaa gtgaaaccac cacgggttgt gggactgaaa ggccagagat 102 0 

gttttgctcc aaaccaccgg gttcttcaac tcgcggcgcc accccgagtg gtttgaccag 108 0 

ggccaagcgg tcgcggataa tacccaaacc gggtttgata cagatgacac cgataataaa 1140 

aaaacaaggc tttcaaaagg aagctgactc cgacaagccg gccccgatcg ccctcccgtt 1200 

tgaagcgtac ttcgccaaca ttggcaacct < cacctggttc gggcaagcgc ttttggtgtt 1260 

tgggatttgt ttgtcttaat taactaa - 1287 
<212> Type : DMA 
<211> Length : 1287 

• .SequenceName : SEQ ID 692 
SequenceDescription : 



Sequence 



<213> Organi smName : Mycoplasma pneumoniae 
<40 0> PreSequenceString : 

atgtttggct tgaaggtgaa gaacgccgag gcggacaccg cgaagagcaa tgaaaaactc 60 

cagggcgctg aggccactgg ttcttcaacc acatctggat ctggccaatc cacccaacgt 12 0 

gggggttcgt caggggacac caaagtcaag gcgttgcagg tggcggtgaa aaagaaatcc 18 0 

gggagtcagg gcaactccgg tgaccaaggc accgaacagg tggaacttga atctaatgat 240 

ttagccaacg ccccgattaa acggggctcc aatccagcaa gtccaactca aggcagccga 3 00 

cttcggcacc accccatcca gttcggaatc tggtcaatca ggcaccccca ccccctgaag 3 60 

gccgtggctt gcgaccgagc aaattcacaa ggacctcccc aaatgatccg cctcgatcct 42 0 

cattctgtac gatgcgcctt atgcctttaa 450 
<212> Type : DNA 
<211> Length : 450 

SequenceName : SEQ ID 693 

SequenceDescription : 

Sequence 



<213> Organi smName : Mycoplasma pneumoniae 
<400> PreSequenceString : 

atgtttggct tgaaggtgaa ggatgcaacc gtggatagtt cgaagcaatc aacggaaagc 60 

ttaaagggcg aagaatcgag ttccagttcc accacatctt ccacctccac cacccaacgt 12 0 

gggggttcgt caggggacac caaagtcaag gcgttgcagg tggcggtgaa aaagaaatcg 18 0 

gactcggagg acaatggtca gatcgaactt gaaaccaaca acctcgccaa cgccccgatt 24 0 

aaacggggct ccaataacaa ccagcaagtc caactcaagg cggacgattt tggtacttcc 3 00 

ccttccagtt cggaatctgg tcaatcaggc acccccaccc cctgaacgcc gtggcttgcg 360 

accgagcaaa ttcacaagga cctccccaaa tgatccgcct cgatcctgat tctgtacgat 420 
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gcgccttatg cgcgcaatcg taccgccatt gatcgcgttg atcacttgga tcccaaggtg 48 0 

atgaccgcga actatccgcc cagttgaaga acgcccaagt gaaaccacca cgggttgtgg 540 
gactgaaagg cgcgcgatgt cctggtccaa accaccgggt tcttcaaccc gcgccgccac 60 0 

cccgattggt ttgaccaggg ccaagcggtc gcggagaata cccaaaccgg gtttgataca 660 
gatgacaccg ataataaaaa gcagggcttt cgcaaacaag gcgaacaatc ccctgccccc 720 
atcgccctcc cgtttgaagc gtacttcgcc aacattggca acctcacctg gttcgggcaa 780 
gcgcttttgg tgtttgggat ttgtttgtct taa 813 
<212> Type : DNA 
<211> Length : 813 

SequenceName : SEQ ID 694 

SequenceDescription : 



Sequence 



<213> OrganismName : Mycoplasma pneumoniae 
<400> Pre Sequences t ring : 

atgggcagtc aaaaccaggg ttctacaaca actacctcag cgggcaatcc cgattcttta 60 

gttactgata aagttgatca aaaaggtcag gtgcaaacaa gtggtcaaaa tttaagtgat 120 

accaactaca ccaacctctc ccccaacttc acccccacat ccgactgacc gaacgcgctc 180 

agcttcacca acaagaacaa cgcgcagcgc gcccagctgt tcctgcacgg cttgttgggc 240 

agcatcccgg tgttggttaa taagagtggc gaaaataacg aaaaattcca agccaccgac 3 00 

caaaaatggt cctacaccga attaaagtcg gatcaaacca agctcaacct ccccgcttat 3 60 

ggtgaggtga atgggctgtt gaatccggcg ttggtggaaa cctattttgg gacgacgcgt 42 0 

acgtctagca ctgcgaacca aaacagtaca accgtccccg gtatcggttt taaaattccc 480 

gaacaaaata atgattcaaa ggctgtgctg atcacccccg ggttggcttg aacgccccag 540 

gacgttggta acctcgttgt cagtggcacc agcttcagct tccagctcgg cgggtggctg 600 

gtcagcttca cggactttgt caaaccccgc gcgggttacc tcggactcca gttaacgggc 660 

ttggatgcaa gtgatgcgac gcagcgcgcc ctcatttggg cccccccggc cctgagcggc 720 

ctttcgtggc agttgggtca accggttggg ccgcgtggag agtgtgtggg atttgaaggg 780 

9<?tgtgggcg gatcaagctc agtccgactc gcaaggatct accaccaccg caacaggggc 840 

taccttaccg gagcacccga atgctttggc ctttcaggtg agtgtggtgg aagcgagtgc 900 

ttacaagcca aacacgagct ccggccaaac ccaatccact aa 942 
<212> Type : DNA 
<211> Length r 942 

SequenceName z SEQ ID 695 

SequenceDescription : 



Sequence 



<213> OrganismName : Mycoplasma pneumoniae 
<400> Pre SequenceSt ring : 

atgagttttg gtttagtggg cactgttaat aataacggtt ggaaatcgcc gtttcgacat 60 

gaaacaaaat accgagcagg gtatgataag ttcaagtatt acaaaaccca ctaccggggt 120 

gcaaaaaaag ctggaaccaa tgatgatcga tggagatgaa ctgcttggtt tgaccttgac 180 

tttgcccacc aaaagatagt gctcattgaa aggggtgaac ttcaccgtca agcagattta 240 

aaaaaatctg accccgcaac aaacgaaact tccaaaaccg tttggggtag cattaaagaa 3 00 

aagctgttac aaaacgtcaa caacctccac tcagagaaag gagtattctt atggttccgt 3 60 

caatctggtt ttacaactac tagaaactag 3 90 
<212> Type : DNA 
<211> Length : 390 

SequenceName : SEQ ID 696 

SequenceDescription : 

Sequence 



<213> OrganismName : Mycobacterium tuberculosis H37Rv 
<400> PreSequenceString : 

atggctgaac cgttggccgt cgatcccacc ggcttgagcg cagcggccgc gaaattggcc 60 

ggcctcgttt ttccgcagcc tccggcgccg atcgcggtca gcggaacgga ttcggtggta 120 

gcagcaatca acgagaccat gccaagcatc gaatcgctgg tcagtgacgg gctgcccggc 180 

gtgaaagccg ccctgactcg aacagcatcc aacatgaacg cggcggcgga cgtctatgcg 240 

aagaccgatc agtcactggg aaccagtttg agccagtatg cattcggctc gtcgggcgaa 3 00 

ggcctggctg gcgtcgcctc ggtcggtggt cagccaagtc aggctaccca gctgctgagc 3 60 

acacccgtgt cacaggtcac gacccagctc ggcgagacgg ccgctgagct ggcaccccgt 420 

gttgttgcga cggtgccgca actcgttcag ctggctccgc acgccgttca gatgtcgcaa 480 

aacgcatccc ccatcgctca gacgatcagt caaaccgccc aacaggccgc ccagagcgcg 540 

cagggcggca gcggcccaat gcccgcacag cttgccagcg ctgaaaaacc ggccaccgag 600 
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caagcggagc cggtccacga agtgacaaac gacgatcagg gcgaccaggg cg&cgtgcag 6 60 

ccggccgagg tcgttgccgc ggcacgtgac gaaggcgccg gcgcatcacc ggcgccagcag 72 0 

cccggcgggg gcgttcccgc gcaagccatg gataccggag ccggtgcccg cccragcggcg 780 

agtccgctgg cggcccccgt cgatccgtcg actccggcac cctcaacaac cacaacgttg 840 

5 tag " " 843 



<212> Type : DNA 

<211> Length : 843 

SequenceName : SEQ ID 697 
SequenceDescription : 

10 

Sequence 



<213> OrganismName : Mycobacterium tuberculosis H37Rv 
■<4 00> Pr ©Sequences tring : , 

15 atgcgttatc tgatagcgac cgcagtgctc gttgctgtgg tcctggtggg ctcggccggcg 60 

gctggtgcgc cgccgtcatg cgccggcctg ggcggcactg tgcaggccgg ccagatctgc 120 

catgtgcacg cctcgggccc taagtacatg ctggatatga catttcctgt cgactatccc 180 

gaccagcagg cgctgaccga ctacatcacg caaaaccgcg acgggttcgt caacgtcgcg 24 0 

caggggtccc cgctgcgaga ccagccctac caaatggacg ccaccagcga acagcacagc 3 00 

20 tccggccagc cgccgcaggc cacccgcagc gtagtgctca aattcttcca ggacctcggt 360 

ggggcacatc cgtccacctg gtacaaggcc ttcaactaca acctcgcgac ctcgcagccc 42 0 

atcaccttcg acacgttgtt cgtgcccggc accacgccac tggacagcat ctaccccatc 480 

gttcagcgcg agctggcacg tcagaccggt ttcggtgccg cgatattgcc tt cgaccggc 540 

ctcgacccgg ctcactacca gaactttgct atcaccgacg acagtctgat tt~t:ctacttc 600 

25 gcccagggtg agctgctgcc gtcgtttgtc ggcgcttgcc aagcccaggt gccgcgcagc 660 

gccattccgc cgctggcaat ctaa 684 
<212> Type : DNA 
<211> Length : 684 

SequenceName : SEQ ID 698 



30 SequenceDescription : 

Sequence 



<213> OrganismName : Mycobacterium tuberculosis H3 7Rv 

35 <400> PreSequenceString : 

atgaagatgg tgaaatcgat cgccgcaggt ctgaccgccg cggctgcaat cgx^cgccgct 60 

gcggccggtg tgacttcgat catggctggc ggcccggtcg tataccagat gcragccggtc 120 

gtcttcggcg cgccactgcc gttggacccg gcatccgccc ctgacgtccc gaccgccgcc 18 0 

cagttgacca gcctgctcaa cagcctcgcc gatcccaacg tgtcgtttgc ga_a,caagggc 240 

40 agtctggtcg agggcggcat cgggggcacc gaggcgcgca tcgccgacca ca^agctgaag 30 0 

aaggccgccg agcacgggga tctgccgctg tcgttcagcg tgacgaacat ccragccggcg 3 60 

gccgccggtt cggccaccgc cgacgtttcc gtctcgggtc cgaagctctc gt cgccggtc 42 0 

acgcagaacg tcacgttcgt gaatcaaggc ggctggatgc tgtcacgcgc at cggcgatg 48 0 

gagttgctgc aggccgcagg gaactga 507 

45 <212> Type : DNA 

<211> Length : 507 

SequenceName : SEQ ID 699 
SequenceDescription : 

50 Sequence 



<213> OrganismName : Mycobacterium tuberculosis H37Rv 
<400> PreSequenceString : 

atgacctact cgccgggtaa ccccggatac ccgcaagcgc agcccgcagg ctcctacgga 60 

55 ggcgtcacac cctcgttcgc ccacgccgat gagggtgcga gcaagctacc ga_tgtacctg 120 

aacatcgcgg tggcagtgct cggcctggct gcgtacttcg ccagcttcgg ccrcaatgttc 18 0 

accctcagta ccgaactcgg cggaggtgat ggcgcagtgt ccggtgacac tgjggctgccg 240 

gtcggggtgg ctctgctggc tgcgctgctt gccggggtgg ctctggtgcc ta.aggccaag 3 00 

agccatgtga cggtagttgc ggtgctcggg gtactcggcg tatttctgat gcjtctcggcg 3 60 

60 acgtttaaca agcccagcgc ctattcgacc ggttgggcat tgtgggttgt gttggctttc 420 

atcgtgttcc aggcggttgc ggcagtcctg gcgctcttgg tggagaccgg cgjctatcacc 4 80 

gcgccggcgc cgcggcccaa gttcgacccg tatggacagt acgggcggta cg-ggcagtac 54 0 

gggcagtacg gggtgcagcc gggtgggtac tacggtcagc agggtgctca gcraggccgcg 600 

ggactgcagt cgcccggccc gcagcagtct ccgcagcctc ccggatatgg gtcgcagtac 660 

65 ggcggctatt cgtccagtcc gagccaatcg ggcagtggat acactgctca gcrccccggcc 720 

cagccgccgg cgcagtccgg gtcgcaacaa tcgcaccagg gcccatccac gccacctacc 7 80 

ggctttccga gcttcagccc gccgccaccg gtcagtgccg ggacggggtc gc=aggctggt 84 0 
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tcggctccag tcaactattc aaaccccagc gggggcgagc agtcgtcgtc ccccgggggg 900 
gcgccggtct a a " ~ " ' 912 

<212> Type : DNA 
<211> Length : 912 

SequenceName : SEQ ID 700 

SequenceDescription : 

Sequence 



<213> OrganisitiName : Mycobacterium tuberculosis H37Rv 
<4 00> PreSequenceString : 

atgaaatgtc caggcgtctc cgactgcgtt gcgaccgtaa ggcacgataa cgtgtttgct 60 

attgctgctg gtttgcgttg gtcggccgct gtaccgccgc tacacaaagg ggacgctgtg 120 

accaaactgc tcgtcggggc catcgcgggc ggaatgctag cttgcgcagc fcatattgggc. 180 

gacggaatcg cttcggccga tactgcgttg atagtacccg gtaccgcacc gtccccgtac 240 

gggccactca ggtcgctcta tcatttcaat cccgcgatgc agcctcagat cggcgcgaat 30 0 

tactacaacc ccaccgctac ccgccacgtc gtttcatatc caggcagctt ttggcctgtc 3 60 

acaggcttga attcgcccac cgtcggcagt tctgtcagtg ccgggacgaa caatctcgat 420 

gcggcgatcc gcagcactga cggaccaatc ttcgtggccg ggttatcaca gggcacgctc 480 

gtgcttgacc gcgagcaggc acggttagcg aatgacccga cggctcctcc ccctgggcaa 540 

ctcacattca tcaaggccgg cgaccctaac aatcttcttt ggcgggcgtt taggccggga 60 0 

acccacgtgc cgatcatcga ctacaccgtt ccggccccag cggaaagcca gtacgacaca 660 

atcaatatcg tgggccagta cgacattttt tctgacccgc ctaatcgtcc gggcaaccta 720 

ctcgctgacc tcaatgcgat tgccgcgggc ggatactacg gccacagcgc caccgcattc 780 

tcggacccag ctcgcgttgc gcctagggac attacgacga caacgaacag tttgggtgcg 840 

acgaccacga cctacttcat ccggaccgat cagctacctc tggtgcgggc gctggtggac 900 

atggcgggcc tgcccccgca ggcggcggga acagttgatg ccgcactgcg gcccataatt 960 

gacagggctt atcagcccgg accagcaccc gctgtgaacc cgcgtgattt ggtccagggc 102 0 

atccgcggta tccccgccat cgcccctgcc atcgccatcc ctatcggcag caccaccggg 1080 

gccagtgccg ccaccagcac cgctgccgcc acggcagcag caacaaatgc gctccgcggg 1140 

gccaacgtgg gcccgggcgc caacaaggcg ttgtcgatgg tccggggttt gctacccaaa 1200 

gggaagaagc actag ~" 1215 
<212> Type : DNA 
<211> Length r 1215 

SequenceName : SEQ ID 701 

SequenceDescription : 

Sequence 



<213> OrganismName : Mycobacterium tuberculosis H37Rv 
<400> PreSequenceString : 

gtgagcctgc taccaaccct gcagtccttc ctaccgccgc ccttcgacgc cattccgaac 60 

cccatcgagg atcttgacgt tctcgtagcc gcagctgtag ccgtcgcggc tggcagtttg 12 0 

ggggtatcgg cggcgcagct cggcgagatc taccgccacg acgtggtcga cgaggcccaa 180 

aaggcgccgc attgcccagc cgaatccgac cagacacccg ctggggccgc gggtgacggg 240 

gatctccctg aggtcggagg acgggtcacc agcccgccac agccgccggt cgccgcgctc 300 

accggctact ccgctaacat cggcggactc tccgtgccgc acagctggaa tcttccgcca 360 

gcggtgcgcc aagttgcggc gatgttcccc ggcgcgactc cgatgtatat gacggggagt 420 

tcggacggct cctacgccgg cctggcagcg gcgggtttgg ccggcaccgg tctggccggt 48 0 

cttgccgccc gcggtggctc cgcgccgacc ccggctgcag ccgccccggc cggagccggc 540 

ggagccggcc cggctgccac caggcccgcc gcccagcaga cgcccgcggt ccccgcggcg 600 

gccgccgggt cagccatacc tggcctaccg cccggtttgc cgcccggcgt ggttgccaac 660 

cttgcggcga ccctggcggc gatccccgga gcgaccatca tcgtggtacc gccgtccccg 72 0 

aacgccaatc aatag 735 
<212> Type : DNA 
<211> Length : 735 

SequenceName : SEQ ID 702 

SequenceDescription : 

Sequence 



<213> OrganismName : Mycobacterium tuberculosis H37Rv 
<400> PreSequenceString : 

atggacgtcg ctttgggggt tgcggtcacg gatcgggtcg cgcgtctggc gctggtcgac 60 

tcggctgcgc ccggcaccgt gatcgaccag ttcgtgctcg atgtggccga gcacccggtc 120 

gaggtgttaa ccgagaccgt ggtgggcacg gatcggtcat tggccggcga aaaccaccgg 180 

ctggtcgcta cccggctgtg ttggccggat caggccaaag ctgacgagct gcagcacgca 240 



10 



WO 2005/076010 PCT/IN2005/000037 

314/341 

ctgcaggact ccggggtcca cgacgttgcc gtgatatccg aggcgcaggc cgccacggcg 3 00 

ctggtcgggg cggcacatgc cggctctgcc gtgctgttgg tgggtgatga gacggcaacc 3 60 

ttatcggtgg ttggtgaccc ggacgcgccg ccgacgatgg tggccgtcgc gccggtggcg 420 

ggcgccgacg ccacatcgac cgtcgatacc ctgatggccc ggctcggcga ccaggccctc 48 0 

gccccggggg atgtcttcct ggtgggtagg tccgccgagc acaccacggt tcttgccgac 540 

cagctgcgcg cggcgtcgac gatgcgcgtg cagactcccg acgaccccac gttcgcgctg 600 

gcccgtggcg cggcgatggc ggccggcgcc gctacgatgg cgcacccggc cctggtcgcg 660 

gatgcgacca cttcgctccc ccgggccgag gcggggcaat cgggttctga aggcgagcag 72 0 

ctggcgtact cgcaggccag cgattacgag ctgcttccgg tcgacgaata tgaggaacac 780 

gacgaatacg gggcagccgc ggatcgctcg gcgccgttga gccgacggtc gctgctgatc 840 

ggcaacgctg tcgtggcctt tgcggtgatc ggtttcgcct cgctggcggt ggcggtggcg 9 00 

gtcaccatcc gaccgaccgc ggcctcaaaa ccggtagagg gacaccaaaa cgcccagcca 960 

gggaagttca tgccgttgtt gccgacgcaa cagcaggcgc cggtcccgcc gcctccgccc 1020 

gatgatqcca ccgctggatt ccagggcggc accattccgg ctgtacagaa . cgtggtgccg .1080- 

15 . cggccgggta cctcacccgg ggtgggtggg acgccggctt cgcctgcgcc ggaagcgccg 1140 

gccgtgcccg gtgttgtgcc tgccccggtg ccaatcccgg tcccgatcat cattcccccg 12 0 0 

ttcccgggtt ggcagcctgg aatgccgacc atccccaccg caccgccgac gacgccggtg 12 60 

accacgtcgg cgacgacgcc gccgaccacg ccgccgacca cgccggtgac cacgccgcca 13 2 0 

acgacgccgc cgaccacgcc ggtgaccacg ccgccaacga cgccgccgac cacgccggtg 13 8 0 

20 accacgccac caacgaccgt cgccccgacg accgtcgccc cgacgacggt cgctccgacc 1440 

accgtcgccc cgaccacggfc cgctccagcc accgccacgc cgacgaccgt cgctccgcag 15 0 0 

ccgacgcagc agcccacgca acaaccaacc caacagatgc caacccagca gcagaccgtg 1560 

gccccgcaga cggtggcgcc ggctccgcag ccgccgtccg gtggccgcaa cggcagcggc 1620 

gggggcgact tattcggcgg gttctga 1647 
<212> Type : DNA 
<211> Length : 1647 

SequenceName : SEQ ID 703 
SequenceDescription : 

Sequence 



<213> Organ! smName : Mycobacterium tuberculosis H37Rv 
<400> PreSequenceString : 

ttgaagaacg cccgtacgac gctcatcgcc gccgcgattg ccgggacgtt ggtgaccacg 60 

tcaccagccg gtatcgccaa tgccgacgac gcgggcttgg acccaaacgc cgcagccggc 12 0 

ccggatgccg tgggctttga cccgaacctg ccgccggccc cggacgctgc acccgtcgat 180 

actccgccgg ctccggagga cgcgggcttt gatcccaacc tccccccgcc gctggccccg .240 

gacttcctgt ccccgcctgc ggaggaagcg cctcccgtgc ccgtggccta cagcgtgaac '300 

tgggacgcga tcgcgcagtg cgagtccggt ggaaactggt cgatcaacac cggtaacggt 3 60 

tactacggcg gcctgcggtt caccgccggc acctggcgtg ccaacggtgg ctcggggtcc 420 

gcggccaacg cgagccggga ggagcagatc cgggtggctg agaacgtgct gcgttcgcag 48 0 

ggtatccgcg cctggccggt ctgcggccgc cgcggctga 519 
<212> Type : DNA 
<211> Length : 519 

SequenceName : SEQ ID 704 

SequenceDescription : 

Sequence 



<213> OrganismName : Mycobacterium tuberculosis H3 7Rv 
<400> PreSequenceString : 

atgacgcggc tgataccggg ttgcacgctc gtcgggctga tgctgacgtt actgcccgcg 60 

cccacctcgg cggccgggag caacaccgcc accaccctgt tcccggtcga cgaggtcacc 12 0 

cagctggaga cgcacacctt cctcgattgc caccccaacg gcagctgcga cttcgtcgct 180 

ggagcaaatc tgcgcacacc cgacggcccg acgggctttc cgcccgggct gtgggcgcgc 24 0 

caaaccaccg agatccgttc gacgaaccgg ttggcctatc tggacgcgca cgccaccagc 3 00 

cagttcgaac gggtaatgaa ggcgggcgga tccgacgtga tcaccaccgt ctacttcggc 3 60 

gagggtccgc cggacaaata ccagaccacc ggggtcatcg actcgaccaa ttggtcgacc 420 

ggtcaaccga tgaccgacgt caacgtcatc gtgtgtacac acatgcaggt ggtctacccg 48 0 

ggggtcaacc tcacctcgcc cagcacctgc gcgcaagcca acttttccta g 531 

<212> Type : DNA 
<211> Length : 531 

SequenceName : SEQ ID 705 

SequenceDescription : 



Sequence 
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<213> OrganistnName : Mycobacterium tuberculosis H37Rv 
<400> PreSequenceString : 

atgacaccgg gtttgcttac tactgcgggt gctggccgac cacgtgacag gtgcgccagg 60 
5 atcgtatgca cggtgttcat cgaaaccgcc gttgtcgcga ccatgtttgt cgcgttgttg 120 
ggtctgtcca ccatcagctc gaaagccgac gacatcgatt gggacgccat cgcgcaatgc 18 0 

gaatccggcg gcaattgggc ggccaacacc ggtaacgggt tatacggtgg tctgcagatc 240 
agccaggcga cgtgggattc caacggtggt gtcgggtcgc cggcggccgc gagtccccag 3 00 

caacagatcg aggtcgcaga caacattatg aaaacccaag gcccgggtgc gtggccgaaa 3 60 

10 tgtagttctt gtagtcaggg agacgcaccg ctgggctcgc tcacccacat cctgacgttc 420 
ctcgcggccg agactggagg ttgttcgggg agcagggacg attga 465 
<212> Type : DNA 
<211> Length : 465 

,SequenceName :. .SEQ ID .706 

15 SequenceDe script ion : 

Sequence 



<213> OrganismName : Mycobacterium tuberculosis H37Rv 

20 <400> PreSequenceString : 

atgatgcaac aagcggtgtc gggcattacc ggcgcgctcg gcggcgcggt cggcggcgtc 60 

atgggcccac tcacgcagct tccccagcag gccatgcaag cggggcaggg agcaatgcag 120 

ccgctgatga gtgcgcttca acagacctat ggcgcggagg gactggacgt cgcggacggg 18 0 

gcgcggctgg tggacagcat cgaaggtgag cccggcctcg gcggcgagcc gggcgctggt 240 

25 gacgtcggcg ccggcggcgg gggtggtggc accaccccga cgggctatct gggtccccca 3 00 

cccgtgccga cgtcgtcgcc accgacgact ccagccgggg cgccggccaa gtcggtgacg 3 60 

ccggacccgg ttagtggcac cccgcgggcg tcggggccgg ccggcatgac cggcatgccg 420 

atggtgccgc cgggcgcgtt gggtgcgggc gcggaaggag ccaataagga caagccggtc 480 

gagaagcggg tgacgggctg tgccgaatgg tcaaccggtc aagggccgct taacagtacc 540 

30 gccgagtgtt ccggtgaaat ctgccgacga caagccggtg gtcaccaagt cgacgcgacg 600 

gatccttgtt gtgccgaacg acgacaaggt taa 633 
<212> Type : DNA 
<211> Length : 633 

SequenceName : SEQ XD 707 



35 SequenceDescription : 

Sequence 



<213> OrganismName : Mycobacterium tuberculosis H37Rv 

40 <400> PreSequenceString z 

atgattcgcg aactggtcac caccgctgcg atcacgggtg ccgcgatcgg tggggcgcca 60 
gtcgcgggcg cagacccgca gcgttatgac ggcgatgtgc cggggatgaa ctatgacgct 120 
tcgctgggcg ccccatgctc cagctgggag cgcttcattt ttggacgagg cccctccggt 18 0 

caggccgaag cctgtcattt tccgcctcct aaccagttcc cgccggccga aaccggctac 240 

45 tgggtgatct cctacccgct atacggcgtc cagcaggtcg gtgcgccgtg tccgaagccg 3 00 

caggcggccg cgcagtctcc ggatgggttg ccgatgctgt gtctgggagc ccgtggatgg 360 
cagccgggat ggtttaccgg ggccgggttc ttccctccgg agccataa 4 08 

<212> Type : DNA 
<211> Length : 408 

5-0 SequenceName : SEQ ID 708 

SequenceDescription : 

Sequence 



55 <213> OrganismName : Mycobacterium tuberculosis H37Rv 
<400> PreSequenceString : 

atgaaaacca caggcacaac tatcaaactc ggcatcgtct ggttggtgct gtcggtgttc 60 

accgtgatga tcatcgtggt gttcgggcag gtgcggttcc atcacaccac cgggtactcc 120 

gcggtgttca cccatgtcag cgggctgcgg gccgggcaat ttgtccgcgc tgcgggcgta 18 0 

60 gaggtcggca aggtcgccaa ggtaacgctg atcgacgggg acaagcaagt attggtggac 240 

ttcaccgtgg atcgctcgct gtcactggat caggcgacga ccgcctcgat ccgctacctc 300 

aacctgatcg gcgaccggta ccttgagctc ggccgcggtc acagcggtca gcggctggcg 360 

ccgggtgcca cgatcccgct cgagcacacc catccggcct tggatctcga cgctctgctc 420 

ggcgggtttc gcccactctt ccaaacgttg gacccagaca aggtcaacag catcgcctcc 480 

65 tcgatcatca ccgtgttcca agggcaaggc gccaccatca acgacatcct cgaccagacc 540 

gcctcgctga cggcaacgct ggccgaccgg gaccatgcga taggtgaggt cgtcaacaac 600 

ttgaacaccg tgctggccac caccgtcaag catcaaacgg aattcgaccg cacggtcgac 660 
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aagctagagg tgctgatcac tggactgaag aacagggcgg acccgctggc cgcggcggcg 72 0 

gcacacatca gcagcgccgc gggaacccta gccgacctgc tggggcggat cgtccattgc 78 0 

tgcacagcag cttcgggcac ctcgagggca tccagcagcc gctcatag 828 
<212> Type : DNA 
5 <211> Length : 828 

SequenceName : SEQ ID 709 

SequenceDe script ion : 

Sequence 

10 

<213> OrganistnName : Mycobacterium tuberculosis H3 7Rv 

<400> PreSequenceString : 

atgactccac gcagccttgt tcgcatcgtt 

gtgagcgcac <,ccgccggcgg .tcgtgccgcg 
15 gttttcgctc gcggcacgca tcaggcttct 

gactcgctta cctcgcaagt tggcgggcgg 

gcaagcgacg actaccgcgc gagcgcgtca 

cagcgcaccg tcgccagctg cccgaacacc 

gcgacggtca tcgatttgtc cacctcggcg 
20 gctgtcgccc ttttcggcga gccatccagt 

tcgttgccga caatcggtcc gctgtatagc 

gatccaatat gcaccggagg cggcaatatt 

atgacaagcc aggcggcgac attcgcggcg 

25 <212> Type : DNA 

<211> Length : 654 

SequenceName : SEQ ID 710 
SequenceDescription : 

30 Sequence 



<213> OrganismName : Mycobacterium tuberculosis H3 7R.V 
<400> PreSequenceString : 

gtgataagca ccacaagaat tgatttccta tggatattgt cggtagcgtt cgcgtccatg 60 

35 attgctcttg caacgctgtt gacgcttatc aatcaagtcg tcggcactcc gtatattccc 120 

ggtggcgatt ctcccgccgg gaccgactgc tcggagctgg cttcgtgggt atcgaatgcg 180 

gcgacggcca ggccggtttt cggagatagg ttcaacaccg gcaacgagga agccgccttg 240 

gcggctcggg gctttcaaca gggaaccgcc cccaatgcct tggtgatcgg ttggaatggc 3 00, 

caccacacgg cggtgacgct gcccgatggc acgcccgtat ccagtggtga aggcggtggc 360 

40 gtgcgggtcg gtggcggtgg cgcctaccag cccaaattca cccaccacat gtatctgccg 420 

atggatgtgg acgcgggaga agaccagccg ccggcgccag atgagccggt caccgcggtc 48 0 

gacgacgtgg aaccggaaat gcctgcaccg tgcccgaccc agcgcccgcc ggtgaccccg 540 

agacataacc tgtgcaacaa actccggact atgccagggg cgctctcggc cgcgctggcc 600 

gcggcggcgc cggtctggcc ggcccctata agcggctgcc gcgggttcag cacgtccctc 660 

45 ttagcaaaaa gaaatcaccc agtaatcgtc gggaaatag 699 



<212> Type : DNA 

<211> Length : 699 

SequenceName : SEQ ID 711 
SequenceDescription : 

50 

Sequence 



<213> OrganismName : Mycobacterium tuberculosis H3 7Rv 
<400> PreSequenceString : 

55 atgacgacga tgattactct tcggcgacgg ttcgcggtgg ccgtcgccgg cgtcgccact 60 
gccgccgcga cgaccgtcac cctggctccc gcaccagcaa atgccgccga tgtctatggc 120 
gcaattgcct actccggcaa cggctcgtgg ggccgatcgt gggactaccc aacccgggcg 18 0 

gctgccgaag ccaccgccgt caagtcgtgt ggctactccg actgcaaggt gctcaccagt 240 
ttcaccgcct gcggcgccgt cgccgccaac gatagggcat accagggagg agttggaccc 3 00 

60 accttggccg ccgccatgaa ggacgccctg accaagctcg gcggcggcta catcgacacc 360 
tgggcctgca actaa 375 
<212> Type : DNA 
<211> Length : 375 

SequenceName : SEQ ID 712 

65 SequenceDescription : 



ggtgtcgtgg ttgcgacgac cttggcgctg 60 

catgcggatc cgtgttcgga catcgcggtc 12 0 

ggtcttggcg acgtcggtga ggcgttcgtc 18 0 

tcgattgggg tctacgcggt gaactaccca 240 

aacggttccg atgatgcgag cgcccacatc 3 00 

aggattgtgc ttggtggcta ttcgcagggt 3 60 

atgccgcccg cggtggcaga tcatgtcgcc 42 0 

ggtttctcca gcatgttgtg gggcggcggg 480 

tctaagacca taaacttgtg tgctcccgac 540 

atggcgcatg tttcgtatgt tcagtcgggg 600 

aacaggctcg atcacgccgg atga 654 



Sequence 
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<213> OrganismName : Mycobacterium tuberculosis H37Rv 
<400> PreSequenceString : 

atggccggac tgaacattta cgtgaggcgc tggcggacag cgcttcacgc aaccgtgtcg 60 
5 gcattgatag ttgccatcct cggactcgcc atcaccccgg tcgctagtgc ggcgacggcc 12 0 

agggcgacgt tgtcggtgac atcgacgtgg cagaccggtt tcatcgcccg cttcaccatc 18 0 

acaaactcga gcacggcgcc gctaaccgat tggaagcttg aattcgactt gccggcagga 240 
gaatccgtct tgcacacatg gaatagcacc gttgcacgat ctggcacgca ctacgttctc 3 00 

agcccagcga attggaatcg catcattgcc cccggtggtt cagccacggg cggcctaaga 3 60 

10 ggcgggctga ccggttctta ctcgccgccg tcgagttgtc tgctcaacgg gcaatatcct 42 0 

tgcacctag 429 
<212> Type : DNA 
<211> Length : 429 

SequenceName s SEQ ID 713 

15 SequenceDescription : 

Sequence 



<213> OrganismName ; Mycobacterium tuberculosis H37Rv 

20 <4 00> PreSequenceString : 

atgctgactc gcgctatcaa gacccagctg gtgttgttga cggtgttggc ggtcatcgcg 60 

gtggtggtcc ttggttggta tttcctgcgg atacccagcc tggtcggcat cggtcgatac 12 0 

acgctttatg ccgaattgcc tcggtccggg ggtctatacc gaacagccaa cgtcacatat 18 0 

cggggcatca ccatagggaa ggtcaccggc gtcgaaccaa ccgagcgggg cgcgcgagca 24 0 

25 accatgagca tcgacaatgg ctaccagatc cccaccgacg cctcggccaa tgtgcactca 3 00 

gtgtcggcgg tcggcgagca gttcgttgac ctggtgtcga cccgcaccag cggtccgtat 360 

ctgcggcatg ggcagacgat caccacgact acggtcccca gccagattgg cccggcgctg 42 0 

gacgccgcca accgtggatt ggcagtgctg cccaaagacc gggtcgcgtc ggtgctgcac 48 0 

gaggcgtcgg aggccgtggg cgggctggga tcctcactga atcgcctcat cgaagccacc 540 

30 caggcaatcg cccacgatgt caggggcagc ctcgaggaca tcgacgacat catcgagcgt 600 

tcggcgccta tcatcgatag ccaggtcaat tccggcaacg agatcgcccg ctgggccgcc 660 

aacctcaaca cgctggccgc tcagaccgcg cagaccgatc cggcggtgcg aagcattctg 720 

gccaacgcgg caccgactgc cgatcaggtc aacgccacgt tcagcgacgt gcgggagtcg 780 

ttgccgcaga cgctggccaa tctcgaggtc gtaatcgata tgctcaagcg ctaccacaac 840 

35 ggcgtcgagc aggcgttggt gttcttgccg cagtccggcg cgatcgccca gtcggttact 900 

acagagttcc ccggccaggc cggactgggt gtcggcggcc tggcgctcaa ccaaccaccg 960 

ccgtgcctga ccggcttcct gccggcgtcg gagtggcggt cacctgctga caccagcacc 102 0 

gcaccgctac ccaagggcac ctactgcagg attccgatgg acgcgagcaa tgtggttcgt 1080 

ggagcacgca acaacccgtg tgtagacgtg cccggcaagc gggcggcgac cccgcgggaa 1140 

40 tgccgcagca atgaagctta tgtgcccggg ggcaccaatc cctggtatgg ggaccccaac 12 00 

cagatgctca gctgtcccgc gccggccgcg cgttgtgacc agccggtgaa gccaggccag 1260 

gtgatcccgg cgccgtcagt taacaatggc atcaacccgc tgcccgccga tcagctgcca 13 2 0 

ggcacacctc caccggtcaa cgatcctttg cagcgacctg ggtcaggcac cgtccagtgc 13 80 

aatgggcaac aacccaaccc gtgcgtctac accccgagca catttcctac aaccatttac 1440 

45 gacgtgcaga gcggcaaagt cgtagcaccc gacggtgtgg tgtattccgt tgaggcttcg 1500 

actcatgccg gagccgacgg atggaaggtg atgctggcac caaccggctg a 15 51 

<212> Type : DNA 
<211> Length : 1551 
50 SequenceName : SEQ ID 714 

SequenceDescription : 

Sequence 

55 <213> OrganismName : Rickettsia prowazekii strain Madrid E 
<400> PreSequenceString : 

atgttaaata atacacaatt tttaaatttg atgaaatcct atatgaaacc agaattttat 60 

atgagttcta taaaaaatac cactaatcta gatctttcat ctatcaccaa tacaattcaa 120 

aaagccatga atattttttt taccactaac aaaatttcca cagaaagtat gcaatctttg 18 0 

60 tttaagaaaa attccgagat tatacaaaat aatattaata ctattttaaa tagtactaaa 240 

gaagtaataa attctaaaga ttttaaacaa gctactgaat atcatcaaaa atgtgtaaaa 3 00 

tctatttatg aaacatctat ggacaatgct aaggaattag caaatattgc ttatgaagct 360 

tcaaataaaa tatttgaagc cgcaaataaa catattacca agaatattca taatgcttct 420 

aataatatac ataatactgc agaacaagta caaaaaaact ttaataacaa atctgcttaa 480 



65 



<212> Type : DNA 
<211> Length : 480 
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SequenceName : SEQ ID 715 
SequenceDescription : 

Sequence 

<213> OrganismName : Rickettsia prowazekii strain Madrid E 
<400> PreSequenceString : 

atgaatatta aattagttac atatttttta atattagtaa gctcattaaa agtaaatgct 60 

gatttaaatc atattcaaga tagttttaaa tatcaagaag cagagcagtt aacaatagaa 120 

ttaccttgga atgactgtac tgcaattcat aaattcttag aagaaaagtt atttttttca 18 0 

gaacaacaaa taaaaaaaga aaataaaatt catgagaaat ataagcaatt ttatttacaa 240 

cataataata agctttctga tttttctatg caatttctag aaaaaaaatc tgaaattaat 3 00 

agtgtcgaaa ctttaatatc aggcttttta aaattttgtg aagataattt tcaaacaagt 3 60 

aaaagtaaat cgcattcttt aaattttttc . caaaaacaac aagaccaatg gttacataat,,. 420 

ataagaaatg agaattataa aacatattat aagaagaaat atgaagacaa tacctttaga 480 

aatattaatt aa 492 
<212> Type : DNA 
<211> Length : 492 

SequenceName : SEQ ID 716 

SequenceDescription : 

Sequence 



<213> OrganismName : Rickettsia prowazekii strain Madrid E 
<4 00> PreSequenceString : 

atgaaaaagt tacttttaat tgctactgca agtgcaacaa ttttatcttc tagtgtatcg 60 

tttgcagagt gcattgataa tgaatggtat ttaagagcag atgcaggtgt agcaatgttt 12 0 

aataaagaac aagataaggc aacaggtgtt aaattaaaat ctaataaggc tattccaatt 18 0 

gatttgggta ttggttatta tatttctgaa aatgtacgtg ctgatttaac tttaggaact 240 

acaataggtg gaaaactcaa gaaatacgga gcagcaacta atacacattt tactggtact 3 00 

aacgtttcag tgagccataa gcctactgtt acacgtttgc ttattaacgg ttatgtagac 3 60 

ttaacaagtt ttgatatgtt tgatgttttc gttggtggtg gtgtcggtcc tgcattagtg 420 

aaagagaaaa ttagtggggt aagcggtctt gcatctaaca ctaaaaataa aaccaatgta 48 0 

tcatataagc tgattttcgg tacttctgcg caaattgcag atggtgttaa agtagagcta 540 

gcatatagct ggataaatga tggtaaaaca aaaactcata atgtaatgta caaaggggca 600 

agtgtgcaaa ccggtggtat gcgttatcaa agtcataacc tgacagtagg tgtaagattt 660 

ggtatataa 669 
<212> Type : DNA 
<211> Length : 669 

SequenceName : SEQ ID 717 

SequenceDescription : 

Sequence 



<213> OrganismName : Rickettsia prowazekii strain Madrid E 
<400> PreSequenceString : 

atgaagaaga atatgagaaa gcaaatgctt aaaatcatat caatcattat tatttctctt 60 

ttattaagca gttgctccga atctacgcgt gatgaaaatg gattacttac agatagtcaa 12 0 

agtactataa ttcgagatta tataatatcg caaaattcta aaaatcttaa agtgaacctt 18 0 

aaagaaaagt ttggttccaa tttaaaagga gtaaaattaa taggaataaa gttaacaaat 24 0 

gaagatttat cgggaataga tttcacttca tgcgaaatat tacggactga cttcatgggt 30 0 

agcaacttag aaaaagcaat acttacaaat tcggtaattc aagaaagtaa ttttgcggat 360 

tcagtaataa aaaatatttc aggctataat gctgattttc aaggttcaat ttttaataat 42 0 

ataacattac aaaatacaaa ttttgttcaa tcaaatttca gtgatactgc ttttaataaa 48 0 

agtactataa tcaatgtcaa ttttgaaaat tctaaattta gtaatgtatt atggtgtcac 540 

agtaatattg acagtagtaa ttttcaaaaa actcatctaa aaaataatag ctttaaaaat 600 

actaatgtaa tgaattcaat attttatggt gcagatttag gcaaaagtgt aataaataat 660 

acaaatttta ctaataatta ttttgaatct agtgacctaa gtaacactaa attcacatca 72 0 

gtaatcatta aagattctaa cttcacacaa agtattttta attcagtaaa tttcaataat 780 

atacaaagta ataactcttt tttttcatat acttcctttg aagattcaac attacacaat 840 

attcacctta ctaaatgtga tttacaaaac agcacaatta atagttcagt tttcaataat 900 

tttaaaatcg acaatgctat attaacaaat atgagtctca acgataatac atttaataat 960 

ttatcaataa aaaatagtaa tactaatttt gtaaggatta ataaatccaa agggtttaat 102 0 

attactttac tcaatactaa ctatagtaat gctattttta gcaataatga tttaaaagaa 1080 

tttaaagtca ttaatactga tttaaacaac agtgaaataa taaactcaaa tttcactaat 1140 

ggacaattta ataatgtaaa tttttctcaa tctttaatac aaaacgtaaa ttttacagac 1200 

gtgaaaatta ctttaggcaa tttaaatcaa gtagctctaa taaattccaa tctaataaac 1260 
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actaatatta ttaactcagt cctttctaat tcacaaataa ataatattaa ctaccaagca 1320 
tattatagtt ttatcaatac taatgtttct aataatattg ttataaatga taattcgaat 13 8 0 
caaattccac caaataatat agtaatcaat tctgaaaaag atttacaaaa catatctaat 1440 
ttagcaaata tgaatttaac aaattttaac ttaagtaatt tagtgtttaa tggagtagat 15 00 
5 ttttcaaaaa gcatctttaa aaaagctaat ttaacaaata cagtaataaa aaattctatt 1560 
ttaaaagatg ctaatttttc tgcagcaata cttactaaaa cagattfcctc aaaatcgata 1620 
ttaacaggta gtatatttaa gtttgctcaa attgatcaga catgttttag taattccgac 1680 
ttaacaaata ctgattttac tgaagcaaca attaaaaata ctgcatttga taatgctaat 1740 
acacacggta taaaaggatt agaataa 1767 
10 <212> Type : DMA 

<211> Length : 1767 

SequenceName : SEQ ID 718 
SequenceDescription : 

15 Sequence 



<213> OrganismName : Porphyromon 
<4 0 0> PreSequenceString : 
gtgatacaaa aatttactaa tgtaaaacta 
20 atgatgtgct ctctgcattt aggtctacaa 
gtggcagccc taccttctat cggtattcaa 
gttgttttcc ccggatttta tagtgtggaa 
atttccatgc cgggttgtgg ctcgtttggg 
aaaaagatga tagccgttcc ggaattttca 
25 acggagacat tcgacaatta taatatctat 
cctgaggggg ggacttatct ggtagaggct 
aatgtaagcc tcccttctac tcactatgtc 
agatttatcg aagttaccct gtatcctttt 
tttgcaaaaa aaatcgaggt tacaataact 
30 aacaccggca tatttaacaa agtagcctcc 
aaatcggcga tagaaaatga tatggtgttc 
aatgttgcca gcaacctccc tcagaactgt 
, aacgtaaatc aacaaccaca cgacgaaatc 
, aacggctttg atgtagctgc tgtaagtata 
35 gccacctcat acatcaacga aactaaactg 
agcaatgcga agaggacttt agatggcaaa 
ttgagcaaat atttggctga cactgataat 
gtctccttaa ttccaagtca tccaactttt 
agttgtgttt cgccccttga tactgtcggc 
40 aatgctcatg aattgcacaa tctgattgaa 
cctattgcac acaaaaatat tctttacgca 
cgtttattct taaaagaaat cgcctctggt 
caggtctctg caatagactc gatatttgac 
tttaacactc atggaatgcc gactgtttgg 
45 ctaacagccc gattgaacaa tacatcttcg 
tcggctgtag cagattcaac tattagatcg 
aacaagggat tctcggcttt cttaggagga 
gaaggcccct gtcctccgtc agaattttat 
ctctcgactg ttgttggcga aatgttgcta 
50 acgtattcga aattcaactt caatttgctt 
ggcatggagg ttagtaattg tattacacta 
acaataaaaa atggtggctg cctaaaaata 
aatggctcca tacaagtcat gtccggagga 
tccggagaga ccggtgctaa ccccaccttt 
55 aacaagcagg tagagataga caatatagac 
atgcccaaat ttcattttga cagtgtgaaa 
tgtattgtgg agataagcaa ttgcgaattt 
tgtgacctaa gcgttgaaaa cagtatgttt 
atggctacaa gctccatcac cggattatct 
60 tttgcgacag gaaacttcgc ctaccatatc 
aatgctgcca tcaagttaga caatattcct 
aattgcgatg aggctcttgt actaaataat 
atcacacgga atgtgataaa aaactgtagg 
atttacaacc gaaataagat cagtaacaat 
65 tgtttttatt tcgataatgc tcctgtaatc 
aataggactt ggcagctcta ttcatcaaac 
agcttgcagg ggggagatac agatacatgg 



gingivalis W83 



aatgatatgc gaaaaatttt gagctttttg 60 

tctcagactt ggcatggaga tccggactca 12 0 

gagtcaagtt gtacccgaat cacgttcgag 180 

aaacgagaag gcaaccaagt ctttcagcgc 240 

aatctgggcg aagctgaatt gcctgttttg 3 00 

acagctaacg ttgctgtaaa aatcaaagag 3 60 

cctaatccta cctatgtcgt agaggagttg 420 

ttcgcgataa acaatgacta ttatagccaa 4 80 

tattctcaag acgggtattt tcgctcacaa 540 

cgatacaacc ctgtccgaca agaaattcta 600 

ttcgataatc ctcagccacc tttacaaaaa 660 

tctgcattta ttaattatga agctgatggc 72 0 

agtcgtggta caacaacgta cataagcgga 780 

gactacttgg ttatttacga tgatatgttc 840 

aaacggctgt gcgaacatag agccttctac 90 0 

aaggacgtat tgaatagctt cccatcaaat 960 

aaaaatttca ttcgctcagt ttacaaccaa 102 0 

ctgggatacg tgctactgat cggaaaacca 1080 

acaaaagtcc caacctcttt tattcataat 1140 

ggttccatat gcgcctccga ctattttttt 12 00 

gatttgttta tcggtcgatt tagcgtcacc 1260 

aagactatca acaaagaaat ctcatataat 13 20 

gaagggaaag gctgcgatgc tccaatctta 13 8 0 

tacacagtca actctatctt aaaatctaat 1440 

tgcttgaata atggttccca tcatttttat 1500 

gggatagggc agggactcga cgtcaatact 1560 

cagggattat gtacgagtct atcatgtagt 1620 

cttggagaag tcctgaccac atacgcacct 1680 

agcagagcca cccaatatgc cgtttattta 1740 

gaatatttac cttattcttt atatcacaat 18 00 

tcatccatta tcaatactaa ttctgttgat 18 60 

ggcgaccctg cactaaacat tatggctcat 1920 

ccaaacaaca ccattataag cagtccgata 1980 

ccggaaaaag gagttttgca ttttactaat 2040 

actctggaaa taggcaatca ggctaaaata 2100 

attaccgttt acggcgatgg tcttgcgatt 2160 

cgacttaact tgttttctac gcattcggtc 2220 

ttcaacagtg ccccgctgta tacaacgaac 22 80 

accaatcgaa gtgacattat ttcaaagaat 2340 

agcagttcgg ggataacggt attcaagcct 2400 

acaaaagcaa agattaccga caatactttt 2460 

acaaacacgc caggcttaac agcaacctcc 2520 

gagtattaca tttccggtaa taaaatagtc 2580 

a gtggcaaca gaacgaacag actccacaat 2640 

attgggagca cgctttataa ttcctatggt 2700 

catataggag tacgtctcct caacaacagt 2760 

aatgaagaag ataagcagac gtttatttct 2820 

ggtacattcc ctctcaactt ccattacaac 28 80 

atttacaacg acacgtatac gaatcgctat 2940 
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attgacgttt caaataatca ctggggcaac aatgatttgt ttgatccgaa tcaggttttc 3 0 00 

aatacgccag acttgttcat ttggatacct ttttgggatg gattgccaaa tgggagatcg 3060 

ggcaatagct ctgctgaagc agtagaattc caaacagcat tggactgtat tggcaatagc 3120 

gattatcttt cggcaaaagt ggctctcaag atgatggttg aaacctaccc ggaatccgac 318 0 

5 tttgcaatag ctgctttgaa ggaattgttc aggatagaga aaatgtcagg caacgattac 324 0 

gaaggcttga aagattattt cagatccaat ccaaccatca tctcttccca gaacttgttc 33 00 

ccgacagctg atttcctgtc tgcgcgatgc gatattgtgt gtgaaaacta tcagtctgcc 3360 

atcgattggt acgaaaatcg cttgaatagt gaaatctcct atcaggacag tgtttttgca 3420 

gtcattgacc ttggtgacat ttattggaat atgcagttag actcactcag agggactggt 3480 

10 atagatttga acatactttc ctgtgaacaa aggaaatcgc tcgaaagcca tcaaaatgta 3540 

aaaaattatt tgttgtcaac tcttcccgaa tcaacaggta ctctcctgcc tccattagaa 36&0 

tgcaacaaat caagccttga taaatccaag ataatctcta tttcgcccaa tccggcgaaa 3660 

gctgttgtaa caataatcta ctataccgat aacccttcct gttctgtaat aaaaatatat 3720 

ggaataaatg gagcctcggc tgatataacc gggttgccca aaqatctatc cgaaggttat ,.,.37-PO 

15 tacagcatac agttcaatac atccaacttt gatcccggtt tctacctggt aacgctaaat 3840 

gttgatcaga aaattataga tacggaaaaa ttacgaatca aataa 38 85 
<212> Type : DNA 
<211> Length : 3885 

SequenceName : SEQ ID 719 

20 SequenceDescription : 

Sequence 



<213> OrganismName : Shigella flexneri 2a str. 2457T 

25 <400> PreSequenceString : 

ttgcagggta aaaacacgat tgtcacaacg ggggattact caattggttt gctaagtcaa 60 

acgagtggaa acctgaatac tgacaccata ataagagtca actctgacgg ttcggttacc 120 

ccctcttttt ctgacggaga tgatacattt attgttactg cgggtaatca cgcagtcggc 180 

gttcttgcgt gtgcatctcc cggaagcgcg tgtgcgtgtg tatcttctct tgatgaagaa 240 

30 agtactgccg atacaggaag taatgaaaat aatgccatag caaaactgga tatggcaaaa 3 00 

ggggagatca caacccacgg aacagaaagt tatgccgcgt acgctaatgg cactgtcgtc 3 60 

aaggctggtg acacgcttga ttataccaat gccagcgtta cattaaccga tgttgatatc 420 

■ accacgcatg gggacaatgc tcatgccatc gctgcccgcc aggggacagt ttcatttaac 480 

cagggagaaa tttatacaac gggtcctgac gccgcgattg ctaaaattta taatggcggc 540 

35 acggtgacgc tgaaaaatac atctgcagtc gcgcatcagg ggtctggaat tgtgctggag 600 

tcttccataa atggtcagga agcaacggta gatattttat ctggtagctc actacggtca 660 

gcaaatgaaa tcctctacca taaagatgag acgagtaacg tgaccattac ggatagtgag 720 

gtatcatcgg ctgcagatgt ttttatcaat aatattaaag gtcatttgac cgtcgatgca 780 

actaattcaa aaataacggg ttcagctaat atttcaacag atgataacac tcatacctat 840 

40 ctgtcgcttt cagataatag tacctgggat attaaagctg actcaacggt gagcaacctc 900 

acggttgata atagtacggt ctatatttcc cgggcagatg gacgggatgt cgaaccaaca 960 

agattaacaa taactgaaaa ttacgttggt aataatggcg tattgcatct cagaactgaa 1020 

ttggatgatg ataattcagc tacggataaa gtcgttatta atggaaatac ctctggaaca 10 80 

acccgagtta aagttaccaa tgcaggaggc agcggagctt acacgttaaa tgggatagag 1140 

45 attatcagcg ttgaggggga atcaaatggg gaatttatta aggattcgag gattttcgcc 12 00 

ggtgcctacg aatattcatt aacccgaggt aataccgaag cgaccaataa aaactggtat 1260 

ctgactaact tccaggcaac gagcggcggt gaaacaaact ccggaggaag ttcagcgcct 13 20 

actgttgcgc ctacccccgt cctgcgcccc gaagctggaa gttacgtcgc caacctggca 13 80 

gccgctaaca ctctttttgt tatgcgtctg aacgaccgtg cgggtgaaac gcgctacatc 1440 

50 gatcctgtaa ctgaacagga gcgttcaagc cgactttggc tacgtcaaat tggcgggcat 15 00 

aatgcctggc gtgacagcaa cggacagttg agaacgacct cgcatcgcta cgtcfccgcag 1560 

ttagggggcg atctgttaac cggtggtttt accgatagtg acagttggcg tttgggagtg 1620 

atggctggtt atgcccgcga ctacaactta actcattcca gcgtgtcgga ttatcgttcg 1680 

aaagggagtg tcagaggcta tagcgcaggg ctgtatgcca cttggtttgc cgatgacatc 1740 

55 agtaaaaaag gcgcatacat tgactcctgg gcgcaatata gctggtttaa aaactccgtg 18 00 

aaaggggatg aattagccta tgaatcctat agcgcgaaag gtgcaaccgt ctcgctggaa 18 60 

gcgggttacg gttttgccct gaataaatcc tttggtctgg aagcggcgaa atatacgtgg 1920 

atcttccagc cacaggcaca ggctatctgg atgggcgtcg atcataatgc gcacacggaa 1980 

gccaatggct cacgtattga gaatgacgca aataacaaca tccagacccg actcggcttc 2 040 

60 cgcaccttta ttcgtactca ggagaaaaac agcggtccgc acggtgacga ctttgaacct 2100 

tttgttgaaa tgaactggat ccataacagt aaagattttg ctgtctcaat gaacggtgtg 2160 

aaagtcgaac aagatggggt gagtaatttg ggggaaatta aacttggcgt aaatggcaac 2220 

ctgaatccag cggccagcgt ctgggggaat gtgggcgtgc agctgggtga taatggctac 2280 

aatgacaccg cagtgatggt gggcctgaaa tataagttct ga 2322 

65 <212> Type : DNA 

<211> Length : 2322 

SequenceName : SEQ ID 720 
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SequenceDescription : 
Sequence 

5 <213> OrganismName : Shigella flexneri 2a str. 2457T 
<400> PreSequenceString : 

gtgaccaaac tcaaacttct ggcacttgga gtgcttatcg caacgtctgc aggcgtagcg 60 
cacgctgaag gtaaattttc cctgggcgca ggcgtaggtg tcgttgagca cccatataaa 120 
gattacgata ccgatgttta cccagtaccg gtaatcaact atgaaggcga taacttctgg 18 0 

10 ttccgtggct taggtggtgg ttactacctg tggaatgacg caacggataa actttcaatt 240 
accgcttact ggtcgccgct ttacfctcaaa gcgaaagaca gtggcgatca ccaaatgcgt 3 00 

cacctggatg accgtaagag caccatgatg gctggtctgt cttatgctca ctttacccag 3 60 

tacggttacc tgcgtaccac cctggctggc gataccctgg ataacagcaa cggcattgtc 42 0 

tgggatatgg cctggttgta tcgttacacc aacggtggcc tgaccgtgac fcccgggtatt, 4 8Q 

15 ggtgtgcagt ggaacagcga aaaccagaac gaatactatt atggcgtatc gcgcaaagag 540 
tccgctcgca gcggtctgcg tggctataac ccgaatgaca gctggagccc ttaccttgag 600 
ctgagcgcca gctacaactt cctcggcgac tggagtgttt acggtaccgc gcgctacacc 660 
cgtctgtctg atgaagttac tgacagcccg atggtggata aatcctggac tggcctgatt 720 
tctaccggga tcacctacaa attctga 747 

20 <212> Type : DNA 

<211> Length : 747 

SequenceName : SEQ ID 721 
SequenceDescription : 

■ 

25 Sequence 



45 



<213> OrganismName : Shigella flexneri 2a str. 2457T 
<400> PreSequenceString : 

atgaaaaaaa ttgcgctagc aggtctggcc ggaatgctgt tggtttctgc atcggtcaat 60 
30 gcaatgagca tcagcggcca ggcgggtaaa gaatacacca atattggtgt cggttttggg 120 
actgaatcga cgggcctggc tfctaagcggt aactggacac ataacgacga cgacggtgac 180 
gtcgcgggcg tggggctggg gttgaatctg cctctcgggc cgttaatggc gaccgttggc 240 
ggaaaaggcg tgtacaccaa cccgaattac ggcgatgaag gttatgccgc agcggtagga 300 
ggtggtttgc agtggaaaat tggcaacagc ttccgtttgt ttggcgagta ttactactct 360 
35 ccggattcgc tctccagcgg tattcaaagt tatgaggaag cgaatgctgg cgcgcgttac 420 
accattatgc gtccagtcag tattgaggcg ggttatcgct acctgaatct gtcgggtaaa 48 0 

gacggtaacc gcgacaacgc tgtggctgac ggcctgtatg ttggggttaa cgccagtttc 540 
tga ~ ~ > — 543 

<212> Type : DNA 
40 <211> Length : 543 

SequenceName : SEQ ID 722 
SequenceDescription : 



Sequence 



<213> OrganismName : Shigella flexneri 2a str. 2457T 
<40 0> PreSequenceString : 

atgactacct tgaccgccag ggtgtttaca acggcagaga taatttatcg gaaaacagtt 60 

atcgcattgg tgtgtcattt aaactgtagt agacaggaga cagtcacaat gaataaaact 12 0 

50 ataatggcgc ttgctatcat gatggcttca tttgccgcaa acgcgtctgt attaccggaa 180 

actcctgtgc catttaaaag tggtaccgga gcaattgata acgacactgt ctacattggt 240 

ttaggtagcg caggtacagc atggtacaag ctagatacac aggccaaaga taaaaaatgg 3 00 

actgcgttag ctgcattccc tggtggaccg agagagcaag caacctctgc atttattgat 360 

ggcaatctgt atgtgtttgg cggcattggc aaaaacagcg agggattgac tcaggtattt 420 

55 aatgacgtac acaaatacaa ccccaaaacc aatagctggg ttaaattgat gtcgcacgcg 48 0 

ccgatgggca tggcgggtca tgtgactttt gtacacaacg gcaaggctta tgttactggc 540 

ggtgttaacc agaatatctt caatggctat tttgaagatc tcaacgaggc tggaaaagat 60 0 

tcaaccgcta tagataaaat caacgcccac tattttgaca aaaaagcaga agattatttc 660 

ttcaataagt ttctgttgtc ttttgatccc tcaacacagc aatggagtta cgctggcgaa 720 

60 tcgccctggt acggaacggc tggtgcggcg gttgtgaata aaggtgataa aacctggctt 78 0 

attaatggcg aagccaaacc aggattgcga acggatgccg tatttgaact tgatttcacc 840 

ggtaataatt taaaatggaa taagcttgat cccgtctcat caccagatgg cgtcgctggc 90 0 

ggttttgcgg ggataagcaa tgattctctt atatttgctg gaggggccgg attcaaaggt 960 

tcacgagaaa attaccagaa cggtaagaac tatgcgcatg aaggcctgaa aaaatcatat 102 0 

65 agcactgata ttcatctttg gcataacggg aaatgggata aatcgggtga attatcgcaa 10 80 

ggtcgggcct acggagtatc attgccctgg aataatagtt tattaattat tggcggtgaa 1140 

actgcaggcg gcaaagcggt gacggattca gttttaattt ctgtgaagga taataaagtc 12 0 0 
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acagtacaaa actaa 1215 
<212> Type : DNA 
<211> Length : 1215 

SequenceName : SEQ ID 723 

SequenceDescription : 

Sequence 



<213> OrganismName : Shigella flexneri 2a str. 2457T 

10 <400> PreSequenceString : 

atggcaactg gcggcgctgc cctggcaggt aaagcggtta tgggtgccgc agccggtgca 60 

gccggaggtg caagtgcact ccaagcggct tttcagaaag catcagcgag tatggaaacc 120 

ggcggtgaca tgtccagcat ggggtcagtt gtcagcagtg gcggaaacgg tggtggtgaa 180 

gcgggtactg ctggcagtag cccattcgcc caagcggctg gctttggtga cagcggcagt .240 

15 agctcaagcg gtggcggctt tgccaaggcc gcgaagctgg ccacaggcac ggcctccgag 3 00 

ttagccaagg gtgtcggctc tcaagtgaag cagggattcc aggagcgagt gagcgaaacc 3 60 

acaggcggaa aactggctgc ttcgatacgc gaaagcatgg agccgaaaga agcaagccaa 420 

tctggccagt tcgagggcaa tagcttgggc gccgattctg gcccagatag taacgaagfcc 480 

aggagt tag 489 

20 <212> Type : DNA 

<211> Length : 489 

SequenceName : SEQ ID 724 
SequenceDescription : 

25 Sequence 



<213> OrganismName : Shigella flexneri 2a str. 2457T 
<4 00> PreSequenceString : 

atgaagcgag ttcttattcc tggcgtcatt ttatgtggcg ctgatgtggc gcaggccgtc 60 

30 gatgacaaaa acatgtacat gtattttttt gaagagatga cggtctatgc tcctgtccct 12 0 

gtacccgtaa acggcaacac gcattacacc agtgaaagca tcgagcgttt accgaccggg 180 

aatggcaata tcagcgatct gctgagaacc aaccctgcgg tacgcatgga ttcaacgcaa 240 

agtacctcgt tgaaccaggg agatattcgc cctgagaaaa tctctattca cggtgcgtcg 300 

ccctaccaga atgcctattt gattgacggt attagtgcaa ctaataacct gaacccagcg 360 

35 aatgagtccg atgccagtag tgcaaccaat attagcggga tgtcacaggg gtattatctt 420 

gatgtcagct tactggacaa tgtgacgctt tatgacagtt ttgtgccggt tgaatttggt 480 

cgcttcaatg gcggggtaat tgatgcaaag atcaaacgct tcaacgctga tgatagcaag 540 

gtgaaattgg gttatcgcac tacgcgtttg gactggttaa catcgcatat cgatgagaat 600 

aacaagagcg catttaatca aggttcttca ggaagtacct atttctctcc agattttaaa 660 

40 aagaactttt ataccttgtc gtttaatcag gagctggctg ataactttgg cgttaccgcc 720 

ggtttatcgc gccgccagtc tgatatcacc cgcgcggatt atgtttcgaa tgacggcatt 780 

gtcgccggtc gggcacagta taaaaacgtt atcgatactg cattgagcaa atttacctgg 84 0 

tttgccagcg accgctttac ccacgattta accttaaaat ataccggctc cagccgtgat 900 

tataatacca gcaccttccc gcagtctgat cgcgaaatgg gtaataaatc ctatggtctg 960 

45 gcatgggata tggatacgca actcgcatgg gccaaactac gtaccaccgt tggttgggat 102 0 

catattagtg attatacccg tcacgatcat gacatctggt acaccgaact ttcatgtaca 1080 

tatggtgata ttacagggcg ttgtacccgt ggcggattag gacacatttc ccaggctgta 1140 

gataattaca ccttcaaaac acgcctggac tggcaaaaat tcgccgtggg tgatgtttcg 12 00 

catcaaccct acttcggcgc ggaatacatc tattccgatg catggactga acgccataac 12 60 

50 cagtctgaat cctatgtgat taatgctgca ggaaagaaaa ctaaccatac catttaccat 13 20 

aaaggtaaag gcagcctggg aattgacaac tacacgctgt atatggcgga tcacattagc 13 80 

tggcggaatg tgtcgttaat gcccggtgtg cgttatgact atgacaacta tctgtcaaac 1440 

cacaatatct ccccgcgctt tatgacggaa tgggatattt ttgctgatca aacctcaatg 1500 

attaccgccg gttataaccg ttactatggc gggaatattc ttgatatggg attacgtgat 1560 

55 atccgcaata gctggacgga atcggtatca ggtaataaaa ccctgacgcg ttatcagaat 1620 

ttgaaaacgc cttataacga tgaactggca atgggattgc agcaaaaaat cgataagaac 1680 

gttattgcac gcgcaagtga agcgcatgat caaatcagca aaagcagtcg taccgacagc 1740 

gcgactaaaa ccaccattac tgaatataac aacgatggca aaaccaaaac gcattcgttt 18 00 

aacctcagtt ttgaactggc cgaacccctg catatccgcc aggtagatat taacccgcaa 18 60 

60 attgtcttta gctatatcaa gagcaagggc aacttgtcgt taaacaatgg ttatgaggag 1920 

agcaataccg gtgataacca ggtggtttat aacggtaatc tggtctctta cgatagcgtt 1980 

ccagtggcag attttaataa cccattaaag atctccttaa acatggattt cacgcatcaa 2040 

ccgagcgggt tagtgtgggc gaatacgctg gcctggcaag aagcgcgtaa agctcgcatt 2100 

atccttggta agacaaatgc gcaatacatc agcgaatatt cagattacaa gcagtatgtt 2160 

65 gacgaaaaac tggatagcag cctgacctgg gacacccgct tgtcctggac gccacaattt 2220 

ctgaaacaac aaaacctgac gatcagtgcc gatattctca atgtactgga tagcaaaacc 22 80 

gctgttgata caacgaacac cggtgtggcg acctacgcca gtggccgtac tttctggctt 2340 
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gatgtcagca tgaaatttta a 
<212> Type : DNA 
<211> Length : 2361 

SeguenceNarae : SEQ ID 725 
5 SequenceDescription : 

Sequence 



<213> OrganismName : Shigella flexneri 2a str. 2457T 
10 <4 00> PreSequenceString : 

atgaagaaaa cattgttagc aatcatgctg gcggggacag cttttgcttc tcaggcggga 
actttagtaa gccagggtac tgaagcgtca gctaatctta ctttgaccaa gcctatcgtt 
gttaataata ccattcaacc agtgaaaggg gtatatagcg gtacgttaac tgcatggacg 
cctttggcaa caggtattgt tggcgcatct gatgggcaga gccacgacta tgctgtaact 
15 ttcccggatg atatttatgc ggagagcagt acttcagcgg atgcagtaat ttctggtgac 
aataacccgg accataaact gaaagtttcc ctgacaacgc ttgagcagga tcctccttca 
gctgcctccg aagagattgg cggtaagcgt tatatgatgc tgaaaaatac tggaacaggt 
ggtgcctatc gcgtcgtttc ccatatgaaa gaacaggttg ttgagccaga ctcttacacc 
atccgtaccc aggcttatat ctacgcagaa taa 
20 <212> Type : DNA 

<211> Length : 513 

SequenceName : SEQ ID 726 
SequenceDescription : 

Sequence 



<213> OrganismName : Shigella flexneri 2a str. 2457T 
<400> PreSequenceString : 

atgggtattt accactggtc ccggaagaca aaaatgaaac ggacaaaatc catacgccac 
gcatcgttcc gcaaaaactg gagcgcacgc catctgacac cagtcgctct cgcggttgcc 
actgttttta tgctggctgg ctgtgaaaag agtgatgaaa cagtgtctct ctatcaaaat 
gctgacgact gttcagctgc aaacccaggc aaaagcgccg aatgtaccac cgcgtacaac 
.aatgcgctga aagaagccga acgtactgcg ccgaaatacg ccacccgtga agactgtgtt 
gctgaatttg gtgaaggtca gtgtcagcaa gcaccagccc aggctggcat ggcaccagaa 
aaccaggcgc aggctcagca atccagcggg agtttctgga tgccgctgat ggccggttac 
atgatggggc gtctgatggg cggcggcgcg ggatttgcac agcagccgct gttctcctcg 
aaaaacccgg ccagtccggc ttacggtaaa tataccgacg cgacgggtaa aaactatggc 
gcagcccagc caggccgcac catgaccgta ccgaagacgg caatggcacc aaaaccggcg 
accaccacta ccgttacccg tggcggtttt ggtgaatctg ttgccaaaca aagcactatg 
cagcgtagcg caaccggcac ctcttctcgt tcaatgggtg gctga 
<212> Type : DNA 0 
<211> Length : 705 

SequenceName : SEQ ID 727 

SequenceDescription : 

Sequence 



<213> OrganismName : Shigella flexneri 2a str. 2457T 
<4 00> PreSequenceString : 

atgacaaaaa tgagtcgcta cgccttgatt accgcgctgg cgatgtttct cgccgggtgt 60 

gtggggcaac gtgaacctgc accggtagaa gaagtgaaac cagcgccgga acaaccagcc 120 

gagccacaac agcctgtccc cacagtgccc tcggtgccga cgatcccgca gcagccaggc 180 

ccaattgagc acgaagatcg aacggcaccg cctgcgccgc atattcgcca ttatgactgg 240 

aatggcgcaa tgcagccgat ggtcagtaag atgcttgggg ctgacggggt gactgcgggt 300 

agcgtcctgc tggttgatag cgttaacaac cgtactaacg gttcgctgaa tgccgcagaa 3 60 

gcgaccgaaa cgctgcgaaa tgcgctggct aataacggga aatttaccct ggtttccgcc 420 

cagcaactgt cgatggctaa gcaacagtta ggtttgtcgc cgcaggacag tttaggcacc 480 

cgtagtaaag ccataggcat tgcccgcaat gtcggcgctc attacgtgct gtactcctgc 540 

gcctctggca acgttaacgc tccgacccta caaatgcagc tgatgctggt gcagacgggc 600 

gaaattatct ggtcaggtaa aggtgccgtt tcgcagcaat aa 642 
<212> Type : DNA 
<211> Length : 642 

SequenceName : SEQ ID 728 

SequenceDescription : 



2361 



60 
120 
180 
240 
300 
360 
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513 
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705 



Sequence 
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<213> OrganismName : Shigella flexneri 2a str. 245 7T 
<400> PreSequenceString : 

atgacgaaac ttatgcaatt tgttcagagg tgctattata tgactaataa gaaaatgtat 60 

tttatattaa tacttgtttt cacattgcta caggtgtgtt tttttgctct ctggaaggct 12 0 

5 cgtgatggta gcacaacgtc tcttgaatgt acgtcgacgt taacaagaaa tgctaaaaca 18 0 

gatcattctt tatactactc tgctaacctt tcagtcattt taaaaaaaga tggtagcggt 240 

agttttacga ttgttggctt aactgatgaa gatacaccga ggaaattttc ccattcgtat 30 0 

ttttttacct acaaaataga tagtaacgga cggatctcag gtaatgctaa ggccaaagtt 360 

tcgggactgg aaaatcagat aaaggatgag aacttcagac tcaattttct cgatgcatct 42 0 

10 ttaacaggta aaggcaatgc gaggctaagc aagtttaata atgtctatat ttttagtatc 48 0 

cc 99ggttga tcattaacac atgtgctcca atataa 516 
<212> Type : DNA 
<211> Length : 516 

SequenceName : SEQ ID 729 - 

15 SequenceDe script ion. : 

Sequence 



<213> OrganismName : Shigella flexneri 2a str. 2457T 

20 <40 0> PreSequenceString : 

atgggcagga taagctcggg aggaatgatg tttaaggcaa taacgacagt cgccgcactg 

gtcatcgcca ccagtgcaat ggcgcaggat gatttaacca ttagcagcct tgcaaagggc 

gaaaccacca aagctgcgtt taatcagatg gtgcaagggc ataagctacc tgcatgggtg 

atgaaaggcg gtacttatac tcccgcacaa accgtaacgt tgggagatga gacgtatcag 

25 gtgatgagcg cgtgcaaacc gcatgactgt ggctcgcaac gtatcgctgt gatgtggtcc 

gagaaatcta atcagatgac ggggctgttc tcggctattg atgagaaaac gtcgcaagag 

aaactcacct ggctgaatgt gaacgatgcg ctttcgattg atggtaaaac ggtgctgttc 

gcggcgttga ccggcagcct ggaaaaccat ccggatggct ttaattttaa ataa 

30 <212> Type : DMA 

<211> Length : 474 

SequenceName : SEQ ID 73 0 
SequenceDescription : 

35 Se^ence 



<213> OrganismName : Streptococcus mutans UA159 
<400> PreSequenceString : 
atgaaaaaac aatttttgga aaaagctgtg tttactgttg cggctacggc agcaacagtt 60 

40 gttttaggaa ataaaatggc tgatgcagac acttatactc ttcaagaagg agattctttt 120 
ttcagtgttg ctcaacgata tcatatggat gcttatgagt tagcttctat gaatggaaaa 180 
gatattacca gtctgatttt gccgggtcag actttaactg ttaatggttc ggcagcaccg 240 
gataatcagg cggcagcgcc aactgacact acgcaagcaa ccactgaaac gaatgatgcg 300 
aatgccaata cttatcctgt tggtcaatgt acttgggggg ttaaagctgt tgcaacttgg 360 

45 gcaggcgact ggtggggcaa tggcggtgat tgggcctcta gtgcttctgc acaaggttac 42 0 

actgtcggta acactccggc agtagggtct attatgtgtt ggacagatgg tggttatgga 48 0 

catgttgcct atgtcacagc tgttggtgaa gatggtaaag ttcaagtact ggaatccaat 540 
tataaagatc aacaatgggt tgataactat cgtggttggt ttgatccaaa taatagtgga 60 0 

acaccaggca gtgtcagtta tatttatcct aactaa 63 6 

50 <212> Type : DNA 

<211> Length : 63 6 

SequenceName : SEQ ID 731 
SequenceDescription : 

55 Sequence 



<213> OrganismName : Streptococcus mutans UA159 
<400> PreSequenceString : 

atgtctatta aaaatatttt agaaaacaaa acaacaacaa ttaaagttag ttttgcagga 60 

60 attgcaacag cagctagttt aattttacct atggcagtac aggcagaaac tacttatact 120 

gtgaaatcag gagatacttt atcagagatt gcttcaacac acggaacaac tgttgataaa 180 

cttgctaagt taaataaaat taataatatc catcttatcc atgctggtca aattttagaa 240 

ttagatgcag caacagaaga tactgatgca acgccagtac aagaaagtca gataaatgaa 3 00 

gcagaaacct cagcatctgc caaaactagt caaacgagcg aagtgacgac aacagcaccg 3 60 

65 gtacaagaaa gccaaacaag tgaagttata acttcagcac cagctgaaac cagtcagaca 42 0 

agcgaagtgc caactgaagc caaccaaaca aatgaagtaa gttcagcagt atcggttgaa 480 

accagtcaaa cgagtgaggc gacgacttca gctccagtgg aaactagtca gacaagcgaa 540 



60 
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gcaacgacag cggaaccaac tgagaccaag accagccaaa caaatgaagt agcagcttca 60 0 

gctgaagaaa accaaacaac atctaatact agcggtttga gcacatctga tgcagctgca 660 
aaagaattca tcgctcaaaa agaatcaggt ggtaattata atgctaaaaa tggtcaatat 72 0 

tatggacgtt atcaattgag tgattcttac ttgaatggtg atttgtcaga agaaaatcaa 780 
5 gaacgtgtag cagatgctta tgtatcaagt cgttatggtt catggactgc tgctcaagct 84 0 

ttctggaatg ctaatggttg gtattaa 867 
<212> Type : DNA 
<211> Length : 867 

SequenceName : SEQ ID 732 
10 SequenceDescription : 

Sequence 



<213> QrganismName : Streptococcus, mutans UA159 

15 <40 0> PreSequenceString : 

atgaaatgtc aagcgtttga agattttaaa gctacaagtt tgaataagct gtcttatacg 60 

acaggtggag ctactgatgg tgaaattata gctaatcgga tgttacaagg taaagctact 12 0 

aagggcgaaa ttactatgta tacttggaac attattcaaa atggctgggt gaattcactc 180 

gtgtcttggg gtattggtgg ttataatagt tctataggat actcagctca aggtaataga 240 

20 ggatttagta attatccata cgatgtttct atggattcag ataatagcag tagttcaagt 3 00 

aatacgacag gtggttatgt taattataac cagagtttta attctggatg gtaa 3 54 



<212> Type : DNA 
<211> Length : 354 
25 SequenceName : SEQ ID 733 

SequenceDescription : 

Sequence 



30 <213> Organi smName : Streptococcus mutans UA159 
<400> PreSequenceString : 

atgcgctatt cacaaatttg tcgtaaaagt ttggctttgc tggctacagg aatgatttta 60 

actacctcaa ccttacctag tataagtatt ctagctgagg acagtactgg - agcacctgct 120 

aggccagatg gacaagctcc tgctggaggt ggtgctaaca ccacaactta tgattacagc 180 

35 ggaatcaaca gtggtgttct agttgctaat ggtagcaagg tcacttccag ttccaaaacc 240 

aaatcaacta cttccgccca aaatacagct cttgttcaaa acggtggtag tttaacactg 3 00 

cataaagcga atttaataaa atccggtgat gataacaatg gtgacaatga taatttttac 360 

ggtattaatt ctattttact agcagtcaat gaaaggtcaa aagcttatgt ttcgaattcc 420 

aaactaaaag ctagtagctc tggcagtaat gggatttttg caactgataa ggcaaccatc 480 

40 tacgctaata aaacaagcat tgcgactaca gctgataatt cacggggact tgatgccact 540 

tacaatggca atattattgc taataagatg gccatttcta caaaaggtgc tcacagcgct 60 0 

gctattgcaa ctgaccgtgg tggcggcaat atttccacca ctaattccag tttaaatact 660 

agtggctctg gctcacctct tctttattca acaggcaata ttcaagttaa tcacgttaca 72 0 

ggaacatcta gtaacagcca aattgctggt atggaaggtc ttaataccat tcttattcat 780 

45 aattctaatt taattagtac catgacaaac aaaactgcca gtgacccgat tgccaatggc 840 

gttatcatct atcagtcaca atccggtgat gccgaagcaa caacggggca aagtgcccac 900 

ttcgagctca gcaagtctaa attaacttct tccattactt caggttctat gttctacctg 960 

acgaatacct ctgcaaacat tatccttaat caatccacct tgaattttga tgcaaataag 102 0 

gctaaacttt tgactgtagc aggcaatagt gccaataatt ggggaacccc cggtagtaat 10 8 0 

50 ggggcaacag ttaactttac tggccataag cagacactta aaggggatgt tgatgtggat 1140 

agtatttcga ccttaaatat gtacctgctt gataaaacca actacactgg caaaactgct 1200 

gtatcaacca acagtaccaa tatatcccca agcacgtctc ctattaccat gaatatttct 1260 

aaaaattcca aatgggtgct aactggtcat tcgacagtaa ccaatctcaa tgctgaaaaa 1320 

ggtgctaaaa ttgttgataa agacggaaaa accgtcagcg tcatctcttc aagcggacaa 13 80 

55 aaacttgtta aaggtaaaag caaatatagc ctaacagtca caggaactta cagtcaaaag 1440 

gtaacaacca gctcaagtaa caaacccagc agcagttaca ttaaccgtag tgacttcgat 15 00 

aattatttta aaacaacaac agcctttgta aataatacca aaaatacaag taattaa 1557 



<212> Type : DNA 
60 <211> Length : 1557 

SequenceName : SEQ ID 734 
SequenceDescription : 

Sequence 
65 

<213> OrganismName : Streptococcus mutans UA159 
<400> PreSequenceString : 
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atgaacaaga taggtgatac tctacgtgat gctcgtattg aaaaaaaatt gagttttgat 60 

gatgttgtag afcaaaactgg aattgctcct cactatatac tggcaatgga gttagatcaa 120 

cttaaattat taccagaagg caaaacaaat gagtatttag aaaaatatgc tcatgctgtt 18 0 

ggcttagatc cggtttctat tattcatggt tatcgcaatc aggaaatgag cgatgaactt 240 

5 atcctacctt cttctgcaga attggctgct tcttcagata gcaatataga aaagaaaaat 3 00 

gaaggaaaat caattgaaga acctcaagag ttagctattg atagtttgga tgttactcaa 3 60 

aatattacag aagagactcc ccaaatagaa gattttaagg ttgaatcaga ggaggcaagt 420 

aaaaaaatag aaaaaatacc atctcgactt agtaaatatg attatgatga agaaccgaaa 480 

aagaaatttc catgggcttt aattctgcta attttacttg ctttaactat tatcagttat 540 

10 gttggatatg tggtttataa tcagctgcaa actgattcaa ataagactga attaagtacc 600 

tctacgaaaa aatctaagga tactaaaaat gatgccaatt caacgacaca aagtcagacc 660 

agcataacaa cagactttgc tgatggtgga aataatatca ccttaagtaa tactaatgga 720 

aaagtagagg ttacctttac tttgacaggg gatgaggaaa gttgggtttc agcaactaac 78 0 

actactgatg gagaatctgg aacaactcta. acagcaacag ataaaactta tactgttact, . ,._840 

15 ttagcagaag gttcaacaac atctatgctg acagtcggat cacctagcgg tgttgaaatt 900 

acgatcaatg gtcaaaaggt ggatactaca aaccttgtta atgctggttt gactaatatt 960 

aatttgacag tccaataa 978 
<212> Type : DNA 
<211> Length : 978 

20 SequenceName : SEQ ID 735 

SequenceDescription : 



Sequence 



25 <213> OrganismName : Streptococcus mutans UA159 
<400> PreSequenceString : 

atgaagagca gaaaacgtca gcgaaaaggg ctggttcgaa aaaatgaaat cattatttta 60 

actctttttg tcgcatcagc cgtttcactc ttagcattta ccaattcttt tggggtactt 120 

gctaagagtc ttcatcttga aaaaatcaat aaatcgataa ctatttcgct tccttttggt 180 

30 aaaaaaaaga tggaacaaac agcacgttac tattcaggtg aacaggttca gatcagttcc 240 

tcagctaaaa aagacagtct tggaaaaggg ctgtctcatt accaaaattg gattgggaca 3 00 

gttaaaaaga taaaatcaca aaaagacagt cgtcaaaagc atcattatag ttatgaagtt 3 60 

acatttgata atggcaaagc tttgaaatat gttcaggaaa aagacttagt, taaaacaaaa 420 

agatctaaat acagcaaagg tcaaattgtc aaattaaaat cctctgcgao agctgattta 480 

35 gatggcagca gtttaacgga ttatcgtgca tcagctggta aaatcgatca tatttcttat 540 
aatcatagca ataccacagg tggctataag tatgatatca cctttgatga aggcggcaag • 600 

gtgactaata ttcaagaaaa agatttggat aaggtttatg aagttcagct aaaatcagaa 660 

aatactgcag ctcaaaataa tgagattctt aaacaagctt ttgcatatgc taaacaacat 720 

tctggtacta tcttaagttt gccaaacggt gaatttaaga ttggcagtca gacaccagac 780 

40 aaggattata tcactttaac atctgatact gaaattcgtg gagataatac gacacttttg 84 0 

gttgaaggat cggcttattg gtttgctttt gcaactggga catcagctag tgatggtgta 900 

aagaatttca ccatgcgcaa tatcaatata aaagccagtg atttggaaaa aggaaatcag 960 

tttatgatta tggctgatca tggtgataat tggaaaattt gtaacaatag ttttaccatg 1020 

gtgcataaaa aaggaagcca catttttgac ttaggttcct tgcaaaattc agcttttgaa 10 80 

45 ggaaatcaat ttactggcta tgcccctgaa ttaactaatg tcagtaagat tgatgataat 1140 

gcagatcttc atgattttta ttcagaagtt atccagctgg atgccgctga gtctagcggt 12 00 

gtttgggatg gtgggttgat aaaggctatt gatccaaatt atgaaaatta taataaagag 12 60 

aaacagttgt gcaataacat cacgattgct aataactctt ttgttcctta tatagacagt 1320 

catggtaaaa taatagcata cagcggaact atcggacagc actcttctga tgtcggcctt 13 80 

50 gtcaaaattt atgataatgt ttttagtaac tctttggtca gccgtttcaa tcaaaatggc 1440 

aaaagcgagg cgtggatttt taaagctatt cacctaaaat caaattataa taatgctgtt 150 0 

tatgccaatt ctatcagtta a 1521 
<212> Type : DNA 
<211> Length : 1521 

55 SequenceName : SEQ ID 736 

SequenceDescription : 



Sequence 

60 <213> OrganismName : Streptococcus mutans UA159 
<4 00> PreSequenceString : 

atgagaaaac ttaaagtggc actttttgca agcagtattt taggaatgct ggctgttagt 60 

tcttatacgg cagcagatac agaggataat caggtaacga ttagccatta taatgaacag 120 

gctggaactt ttgatgtcaa cgctgtacag gcagccaacg gaaaaactat tcaatcgata 180 

65 gatgttgcga tttggtctga agaaaatggg caggatgatt tgaaatggta tcatgccagt 240 

aatgatggca gcaatcaatt gacagttcat tttaatgctg agaatcatgg cagtaaggta 3 00 

ggttcttata ttgcgcatgc ttatattacc tatacagatg gtaatcgagt cggggttaat 3 60 



WO 2005/076010 PCT/IN2005/000037 

327/341 

ttgggaaaac gaaaattatc cttatctgca ccgcaattat ccttaaaaca aggtggcctt 420 

caactatttt ctaagctgaa acctagtgca gcggatcaac ttttttcagc agtttggtcg 480 

gatgagaatg gtcaagatga tcttcattgg tacacggcag atgctgacgg gaatactttg 540 

gctggctatg ctaatcataa aggttatgga acttaccatg ttcatactta ccttaagcaa 60 0 

5 aatggtaaga tgataccaat tagtgctcaa gatattgata ttcctaaacc gaaagtcaag 660 

attcagattg ataaaataaa tgataccagt tatgatgttg ttgttaataa tgtcccccct 72 0 

tatattagtt cagtagccat tcctgtgtgg agtgaacaaa atggccaaga cgatttgaaa 780 

tggtatcagg caacaaaagt ggctgatggt atatttaaaa caactgttta tttaaagaca 840 

catcgttttg aattaggcaa ctatcaagct catatttatg gcgatagcca attaagcaag 900 

10 aaactggatg gtttaggaga aactcatttt aatgttccgt ctattattaa ctatgaagat 960 

cctcaggtaa ctattgatca ttataatatt aacaaaggaa cgtttgatgt gactgtagct 1020 

gaaacagata attcaaaagc gatacaatca atcagtgctg ctgtttggtc tgatgctaac 108 0 

caagctaatc tttattggta tgaagctaaa cagctagcaa atggaaaggc tgcaattact 114 0 

gtcgatgttc aaaagcatgg caatcaaaca ggaagctaca atgtccatgt ttatgttcat ..12 00 

15 tataatgatg gcacgactag cggacatgtt ttggctaatc agcagctcaa tcaaattgtc 1260 

cattatcaac cttctgcagt aagaataaca gcctatatga atgaaaaaaa tacttatcca 13 2 0 

gttggtcagt gtacttgggg agtgaaagaa ttagctcctt ggatacctaa ttggcttggc 138 0 

aatggcgggc agtgggcaag tactgtagct gttaagggat tcaaaatagg aactgttcct 144 0 

aaagttggtg ctattgcttg ttggagtgat ggtggttatg gccatgttgc ttacgttacc 150 0 

20 cacgttgaga gtaataaccg tattcaagtg aaagaagcta attataagaa tcaacaatat 1560 

atttccaatt ttcgcggatg gtttgatccc acgacttcct atttgggaag attaacttat 1620 

atttatcctg actaa 163 5 
<212> Type : DNA 
<211> Length : 1635 

25 SequenceName : SEQ ID 737 

SequenceDescription : 

Sequence 



30 <213> Organi smName : Streptococcus mutans UA159 
<400> PreSequenceString : 

atggcaaata actattctcg tcgtcaacaa cccactaaaa aaacaaaggg gacaagtcgg 60 
aaacgtccga ctgaacatat caaaacaggt ttttcagcac tgcaaaagag tgttgctatt 12 0 

atcgctggta ttfctagggat tattaccgct ttgattacta ttaataatta tcgcaatagt 180 

35 tcacacaatg ataaaaaaga ttccacatct aaaaccacta tcatcaaaga aaaagaagtg 240 
gatgactcaa atagtaacaa caatgctgct aattctcaag ctgaaaatga cagcaataac 3 00 

aataataatt ctgcagaatc aaatcaaaac caaactgcaa caacagcaaa tgacagtaac 3 60 

agcaattcgg ctaatcaaaa tcaagccaat agccaatcac aagcaaataa tcagcaaaat 420 
caaaacaatg ctaatgctgg tcaataa 447 

40 <212> Type : DNA 

<211> Length : 447 

SequenceName : SEQ ID 738 
SequenceDescription : 

45 Sequence 



<213> Organi smName : Streptococcus mutans UA159 
<400> PreSequenceString : 

ttgaaaatat ttagttttgg aaccattcga aacaacacag ctctaaaacc taactatgat 60 
50 gacacaacag cttttagcgg ttttggaacc attcgaaaca acacagctct aaaacagagc 12 0 

actaactgcg ctagctggtt caatcgtttt ggaaccattc gaaacaacac agctctaaaa 180 
ctcaccatat taattaatgg cgtttccttt tgttttggaa ccattcgaaa caacacagct 240 
ctaaaacctc gtggaccaat ttttgtctcg acatttcgta atcgcgccat tcatctcagc 3 00 

cagatttcag cttcaaaatg a 321 
55 <212> Type : DNA 

<211> Length : 321 

SequenceName : SEQ ID 739 
SequenceDescription : 

60 Sequence 



<213> Organi smName : Streptococcus mutans UA159 
<400> PreSequenceString : 

atgaaaagaa aacgaaatct ttactttctt attggtttat ttttgacagt ctttcttttg 60 

65 ataggatgct caatgcagaa aaaaaccaaa tcagaaagca gctcgacttc tcaaaagact 12 0 

actttacaaa caaaacagtc aagtgaaaaa tcaactgatg ctaagcaaac gacagaagct 18 0 

cattcagaaa gcagtcagtc ttcttctcat tctaataacg aggaaaccct tgctcccatt 240 
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gatacaggcg ctgttttaaa ggctgattac agtagtatgg caggaacttg gaaaaatgaa 3 00 

gaaggacaaa cgttgacatt tgatcagcga ggtctgacaa cccctggaat gacagtcagt 3 60 

ctgttgaaca ttgatcaaga cggaaatctt ttgttaaatg ttgagactgg aacaaaaaag 42 0 

aatctaactc tttatattgt gccagccaat aaaaccttat ctaatcaata tttttctaat 480 
5 ggtcaaagcg atgaatccga taaaacaaaa gatcgtattg tttcttctga gagtttaaat 540 
agtggcaaat ttacaaaccg agtttattat catgtttcaa ctcattaa 58 8 

<212> Type : DMA 
<211> Length : 588 

SequenceName : SEQ ID 740 
10 SequenceDescription : 

Sequence 



<213%. OrganismName ? Streptococcus . mutans UA159 
15 <40 0> PreSequenceString : 

atgacaccta aaaaaatcaa aatagctcta acagctctta tctctttgat gctcgcctta 60 

tttttattct tgtttaatca ccattcagta agagaaaaca gtcagcaaga aaagttaaag 120 

ataagtaaag caagttctaa aaaatcacaa acaagcactt cttctgttat gacaagtagt 180 

cgaaaagcta ctgaacaaac aagccaagca cagactcaaa gtcaatcaca agcagaacaa 240 

20 agtaacccta atgtgatcct ccccattccg caagaattag tcggcaccta caaaggttcg 3 00 

agtccacaag catctgaaat aacttttacc atttcttcaa atggtcaatt acgtgctcaa 3 60 

gctaattttg atcctgcttc tgatataaat gacgttaccg ccactgttag tggtgttaga 42 0 

aaagtcgggg cagataccta tatttgggag tttgtctctg gtagttcagc tgctctttta 48 0 

ccgggtgtta caggtatagg agggcttgga aagatgcagc ctggtttcat cctaaaaggg 540 

25 gggcaattaa cacctatcat gtttacaggt tctgtagatg gtgaaattga ttattcacat 60 0 

cccaatccct atccagtatc attaaacaag cagtaa 63 6 
<212> Type : DNA 
<211> Length : 63 6 

SequenceName : SEQ ID 741 



30 SequenceDescription : 

Sequence 

<213> OrganismName t Streptococcus mutans UA159 

35 <400> PreSequenceString : 

atgaaaaaaa taattaatgt tatcgtctta tcactatccg tttttttcct gatagcttgc 60 
agcaatagca gtaccgggga aaaaacaagt caatcatctg aagagactaa ggtccgatta 120 
attgttaaaa cggattctaa taaaaccgat gaaaaagtcg ctttcaaaaa aggtgctact 180 
gttatggatg tcttaaaaga caactataaa gttaaagaaa gcggtggttt tatcactact 240 

40 attgatggtg tcactcagga taaaaaagca ggtaggtact ggatgtttga tgttaatgat 3 00 

aagctggcat caaaagctgc tgataagatt aaggtcaaaa atggggataa aattgaattt 3 60 

tatttgaaag tttataaagg taagaactag 390 
<212> Type : DNA 
<211> Length : 390 

45 SequenceName : SEQ ID 742 

SequenceDescription : 

Sequence 



50 <213> OrganismName : Streptococcus mutans UA159 
<400> PreSequenceString : 

atgtcaaata aaccatggga agaaaaagta actgatgcaa ctactgataa tgaagaaatg 60 

acaagaaatt caaaggatgc tagtattatc agtacaccta ttttaacaat cctattgagc 12 0 

ctctttttct tgattattat tggtatttta ttttttgtac tttatacttc aaatggtgga 180 

55 agcaatgaaa aagcagccac ttcgggtttc tatagttctt ccaaaacggt caaaaaagcc 24 0 

aaaaatgagg caaatagtca aactgatgaa cagacaacag aagcagaaac aagttcaagt 3 00 

gaaacgacaa gttcctcttc agacagtgat ggcgagacaa ttacagttca aggaggtgaa 360 

ggggctgcag caattgctgc gcgtgcgggt atttctgttg ataaactcta tgagctgaat 420 

ccagaacata tgacgcatgg ctattggtat gctaaccctg gagataacat caagattaag 48 0 

60 taa 483 



<212> Type : DNA 

<211> Length : 483 

SequenceName : SEQ ID 743 
SequenceDe script ion : 

65 

Sequence 
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10 



15 



20 



<213> Organi smName : Streptococcus mutans UA159 
<400> PreSequenceString : 

atgccagata atcgcatgaa ctatagtatt gatagcaata tgcagtttcc cttggtagaa 60 

attactttgg aaacaggaga atttgcttat attcaacgcg gtagcatggt ctatcacaca 120 

cccagtgtca ctctcaatac caaagtcaat ggacgtggtt caggacttgg caagctagta 180 

ggagcaattg gtcgttctgt aacgtctgga gaaagttttt tcattactca ggcagtatca 240 

aatgctagcg atggtaaatt ggccttggcc ccttctatgc cgggccaagt tattgcttta 3 00 

gaattgggag aaaaacaata tcgcctcaat gatggtgctt ttcttgcctt agatggttct 3 60 

gctcaatatc aaatgaaagc tcagagtgtt ggacgtgccc tttttggcgg tcaaggcggt 42 0 

ctttttgtta tgacaacaga aggtcaaggt accttgcttg ctaatagttt tggttctatc 480 

aaaaaaatag aattacagaa tcaggaaatt acaattgaca atgctcatgt tgtagcttgg 540 

agtagggatt tgaactatga cattcatttg gaaaatggct ttatgcaatc gatcggaacc 60 0 

ggtgaaggcg ttgtcaatac tttccgagga acgggtgaaa tttatgtaca aagtcttaat 660 
ctgcagragt ttgctggtgt cctacagggt ttcattacca atactaatcg ttaa ^ , 714 

■ * 

<212> Type : DNA 
<211> Length : 714 

SequenceName : SEQ ID 744 

SequenceDescription : 

Sequence 



<213> OrganismName : Streptococcus mutans UA159 
<4 00> PreSequenceString : 

25 atgaaaaaaa attatttttg gtacggtctg cttgggcttc tcgcacttta ccttattact 60 

attgctttta tcccgggatt tcatattttc tttagcaaca tgttgatgtt ggccctgttc 120 

tttatgttaa tagctttgag taacaggagt atcttctttt tctttctagc cttaggtttt 18 0 

cttagtatct acttgaaaga tatctttcat tttgactatt ctaccggacc gctttttacg 240 

ggtataatca ttatcggcgt tattttaaac agtttcctca aaccacacta ttcttattct 300 

30 tataaaggaa atcattattt taatatgaaa caacatgcta actacattga taacgaaaca 3 60 

gatgtctttt taaaaacact tttttctgaa aataccagtt atgtgacttc tcaagaatta 420 

aataaaatta ttattgatac taagtttgga gaacaatctg ttgatctctc tcaagctcaa 480 

tttatgacag attctcccga aattcatata gatgttagct ttggtgaaac . caatctgcgt 540 

attccaaaca actggaaaat catcaataaa actcactccc cctttgcttc catttcattt 600 

35 tcaggttttc ctagcacaaa tggtgatttt attaacgtta cattgactgg aacagtggct 660 

atgggatctc taaacattca atattaa 687 
<212> Type z DNA 
<211> Length : 687 

SequenceName : SEQ ID 745 

40 SequenceDescription : 

Sequence 



<213> OrganismName : Streptococcus pneumoniae R6 

45 <4 00> PreSequenceString : 

atgaaatcaa taactaaaaa gattaaagca actcttgcag gagtagctgc cttgtttgca 60 

gtatttgctc catcatttgt atctgctcaa gaatcatcaa cttacactgt taaagaaggt 12 0 

gatacacttt cagaaatcgc tgaaactcac aacacaacag ttgaaaaatt ggcagaaaac 180 

aaccacattg ataacattca tttgatttat gttgatcaag agttggttat cgatggccct 24 0 

50 gtagcgcctg ttgcaacacc agcgccagct acttatgcgg caccagccgc tcaagatgaa 3 00 

actgtttcag ctccagtagc agaaactcca gtagtaagtg aaacagttgt ttcaactgta 360 

agcggatctg aagcagaagc caaagaatgg atcgctcaaa aagaatcagg tggtagctat 42 0 

acagctacaa atggacgtta tatcggacgt tacggttcat ggactgctgc taaaaacttc 480 

tggcttaaca atggctggta ttaa 504 

55 <212> Type : DNA 

<211> Length : 504 

SequenceName : SEQ ID 746 
SequenceDescription : 

60 Sequence 



<213> OrganismName : Streptococcus pneumoniae R6 
<4 00> PreSequenceString : 

atgaaacatt cacataaaaa atcatttgac tggtatagca tgcaacaacg ttattctatt 60 

65 cgtaagtatt actttggtgc agctagtgtc ttgctcggta ccgctttggt attaggtgca 120 

gcagctagtg tccaaacagt acaagcggaa gaaaacaaac aagaaactac caatagtatt 18 0 

tctgttggta ggggagaagc agctactaaa ccagcagagg tttctgcgtc taataaagag 240 



> 
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aaaacctatg cagctccaac tgtagctaat 
gaagttacta aaccagcaga aaaagttgaa 
catcaagatg ccgttgacaa gtcaaaatta 
gaaagcaagt tatatacaga agcaagtgct 
5 caaagcttgc ttggaaaagc agatgcaact 
attcaatcat ttattattgg tctagaactt 
gaaacgcctg tagcgaagaa agctgatgca 
gcaacaactg aacgttcagc tgttgatagc 
gtagaaacaa cttcagctcc agcatctatt 
10 tctgatgctc gccaaaatcc agctatccgt 
tttagagcgg ctagcaatcc agccaaccca 
tttgcagata ttagccaagg tggtcgtagc 
ggtggaaatt ctattcatta cgatgtaaca 
acaattagtt attctgcgcc aggagattct 
15 gataaagggg atggatttgg aaatccttca 
gtaagggagc aatcaaaatc tattagtcag 
agtatgacct ctgctatttc aacaaatacg 
attaatctaa atggtgattt gtctgttcgfc 
ggtggtggtg gtgccgctac tagcaatgac 

20 gcaaacccac tatacttgga tgcaaaccct 
gaagacattg acttccaaac tgtctatctt 
agattagttc gagaaggtga aaaaggacaa 
ggtaacgaaa cacttttagg attgccgatt 
cgtattatgc aaattggtgt ggctaaagat 

25 caaaataaag tcggtgatac aaataacctc 
gtttatactg aaggtgtaga cgaacttgtt 
ggtgaaaaag gagaccaagg tgaacgcggt 
cgaggcgaac gcggtctaac tggagctcaa 
gaacgcggtc taactggagc tcaaggagct 

30 ggtctaaccg gagctcaagg agctaaaggt 
actggagctc aaggagctaa aggtgaaaaa 
gcacaaggag ctaaaggtga aaaaggggac 
ggtgccaaag gtgaaaaagg ggaccaaggt 
aaaggtgaaa aaggagccca aggtgaacgc 

35 gaaaaaggag accgaggcga acgcggtctg 
ggagaccgag gcgaacgcgg tctgactgga 
caaggtgaac gcggtctaac cggagcccaa 
gaacgcggtc taactggagc acaaggtgaa 
ggagctaaag gtgaaaaagg ggaccaaggc 

40 aaaggagccc aaggtgaacg cggtctaacc 
gaccaaggtg aacgcggtct cactggagcc 
ggtcgtgacg gtgtaactcc aaccgtaacc 
actatcacta ttaacgacgg tagaggtaat 
gatggcgcaa gtccattagt tgcgactcaa 

45 atcttctatt acgataaaaa tggaaacaat 
gaagttgtaa ttgcagatgg tgctaaaggt 
caagggcgtg atggtgaaca aggaccaaaa 
gtaactgatg gtcaagatgg aacgcataca 
actactacag tagtaagaga tggatttgat 

50 aatgaagcag ataaaacaac aactgttatt 
tttgatgaag gagatacaaa acttaaagaa 
aaaggtgaca aaggtgataa cggaaaagat 
aacaataacg gaacacacac catcacaatc 
acaatcgtta aaaacggtga agatggaaaa 

55 gcgaagaaac aaacaacatt aacattctac 
gcaggaaaag acgagttagt tcaaacaaca 
ggagcatctg gacgtgatgg aaaagaagta 
ggtaaagatg gagacacatt cgtaaataca 
aacacttggg aaccagcagg aaacatcaaa 

60 gccaagggtg aaaaaggagc ccaaggagaa 
ggtgaaaagg gagatcaggg tgaacgcggt 
caaggtgaac gcggtttaac tggagcgcaa 
cttcaaggtc gtgatggagc tcaaggacca 
ggaccacaag gaccaaaagg agaacaaggt 

65 tctctaattg ctgttaaaaa tggtgtatta 
caaacaacat ttgtagagga tggacaaaag 
ataactgagg ggcaaaacgg cacacataca 



ccagtagaaa cgactccagt taaaactgaa 3 00 

gaagcaaaag acaaaaaaga ggaagtaacg 360 

ttaacggctc ^tttcgcgtgc taaaaaatta 420 

gctaacttgc aaacaagtat ccaagctggt 480 

gaagctgaat tatcagcagc agagtcatct 540 

cgttctaact ctaataaaga aactgtatca 60 0 

gttgaatcaa aagaaggggc taaaccagct 660 

gctattttgc caactagcac agctgacaaa 720 

aatgaaatct tgaaactagg tttgagcctt 780 

aaggaagatg ttaatagagg gtatagtggt 840 

attgtctcag gttctggaaa tacagttgca 900 

tatagtttcc gtggctacgg gaactcacgt 960 

acagtacgta gtggtaatag tgttaacttt 1020 
.agagagtttg ttaataataa ttttatcttg - 1080 

aatgcaacga tcacaagttc aaatccaaga 114 0 

ggtgccaact acgtatctca ctcaggatac 1200 

gaacaaacta tcagatttag tttgcctatc 1260 

ttgaaacctg ttacttttaa cgtggaccaa 1320 

ccatacagta actctaacta ttattataga 13 8 0 

tatggtggta ctaataataa gactgtttca 1440 

ccaactagta agttaccaga aggtcaaact 15 0 0 

cgtcaaatta cctataaagt tcatcgattt 1560 

agtaatagtg ttactaaaga agctaagcca 162 0 

ctaatcgata cagtaaaacc acgtgttgat 1680 

actttctatc ttgataacga tggaaacggt 1740 

caaaaaattg ctattaaaga tggagctaaa 180 0 

ctaactggag ctaaaggtga aaaaggagac 1860 

ggagctaaag gtgaaaaagg agaccgaggt 1920 

aaaggtgaaa aaggagaccg aggtgaacgc 1980 

gaaaaaggag accgaggcga acgcggtcta 204 0 

ggagcccaag gtgaacgcgg tctaaccgga 2100 

caaggtgaac gcggtctaac cggagcacaa 2160 

gaacgcggtc taaccggagc . acaaggagct 2220 

ggtctaactg gaactcaagg agctaaaggt 2280 

actggagccc aaggagctaa aggtgaaaaa 2340 

gcccaaggtg ctaaaggtga aaaaggagcc 240 0 

ggtgctaaag gtgaaaaagg ggaccaaggc 2460 

aaaggagacc gaggcgaacg cggtttaact 252 0 

gaacgtggta tcactggagc taaaggtgaa 2580 

ggagcccaag gtgctaaagg tgaaaaaggg 2640 

caaggtgaaa aaggagccca aggtcaggca 270 0 

gttaaagata ataaaaatga cggcactcat 2760 

gttacaagta ctgttgtaag agatggtttc 282 0 

cgaaatgacg cagataagac aacaactgtt 28 80 

gaattagatg cttctgataa gaaattaaaa 2940 

gaaaaagggg acaaaggtga acaaggtctt 30 00 

ggagaagatg ggaaaacccc aacagttaaa 3060 

attacaatta acgatggtaa aggtggtata 3120 

ggtgcaagcc ctcttgtttc tactcataga 3180 

ttctattatg atctaaatga taataatcaa 3240 

gttgttatcg cagatggaaa acaaggacca 33 00 

ggattcacac cagaagtaac agttacagat 33 60 

acacaaccag acaacagacc atcattaaca 3420 

acaccaaaag tcaaagcaga acgagatgat 348 0 

attgacaaag atggagatgg aagttacaca 3540 

gtagttaaag acggacaaga cggagctgca 3600 

ttaaacggaa aagtagaccc aacaacagaa 3660 

caaacaggag atgtattcgt taagaaaggt 372 0 

ggaccgaaag gtgacaaagg tgcagatggt 3780 

cgcggcctga ctggagcgca aggtgtcaag 384 0 

ctaactggat ctaaaggtga aaaaggagac 3900 

ggtgccaaag gtgacaaagg tgaacaaggt 3960 

aaaggagcag atggacaaag aggaccagcg 402 0 

aatccaggaa ctccaggtaa agatggaaaa 408 0 

gtaacaatta caccagtaga aggtcgtcca 414 0 

ggtgctgatg ggaaaactcc aacagtaaca 42 0 0 

ttaacggttc ataatccagg aagtccagat 4260 
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gtgacaacta cgatccgtga tggagctaca 
ttaaacggaa aagtaaatcc acaaccaaac 
atcgaaaccg gtgatgtcta tgttaaaaac 
aaaggcccta aaggtgacaa aggtgcagat 
5 gaacgcggcc taactggagc gcaaggagct 
ggacgtgacg gtaaagacgt gttgaacggc 
gacggcgata aatacgttaa tacagaaaca 
tgg-gataaag agggcaacat caaaggccct 
ggcgaaaaag gagaccgagg cgaacgcggc 
10 gafcggagcag caggtcgtga cggacgtgat 
aaagttaacc cagaagcaaa tcaaggtaaa 
ggcgacgtct tcgttaagaa taacggcaac 
aag-ggtgaca aaggtgaacg cggagaagat 
ggtaaagatg gccatagtac tgacattaca 
15 gtaLaatgtta aggacggaga aaatggtctg 
cgtgtccaag gaaaaaacgg aaatccatct 
aacaatgacg gcaaatatac accaggaact 
gafccjgtgcta aaggcgcgga cggacgagat 
aaagaaacta aagtttacca agaagatcca 
20 aaaccacttg cggtaattag agatggagta 
cgtaaagatg aagcagggca taaaggtgta 
caaccaacta cagtctttgt tcaagatggt 
gatggacaaa ctcctacaat cactactcaa 
actatcacaa catcaggtaa agatccagta 
25 aaagatggcc gtgcaccgaa aatcaaagta 
cgcgatacag atgctgctgc aactccaacg 
gafcgttaatg acaatggggt atacgacgaa 
atttataacg gtatagatgg acgtgatggt 
gga-gatggaa ctcacactat cacagttcaa 
30 gtfcaaagatg gtaaagacgg taaaactgcg 
agccacacaa ttacggtgac aaatccagat 
ggtaaagacg gtaagactcc taaagttgaa 
gttaaagtta cagatggaga cggcaatgtt 
ggtaaagctg caacagcaac aactactgaa 
35 actaacccag acggaactaa gaatgagttt 
gga,cgtactc caaccgcatc tgttcgtgat 
acaaatccag aaggtgtgac aactgaaacc 
actataactg atgaacaaaa tggaactcat 
actactgaaa caatcattaa agatggtaaa 
40 gafcggtactt acactattcg tgtggaaaac 
cgtgacggta aatcaccaac tgctaaggtt 
acagttgtga actcagacgg aataactaca 
aaacttgaag ttattgataa caacgatggt 
ggtaaaggaa cgacaactac aatctttgat 
45 aacggagatg gaactcatac attaacaatc 
attatcaaag atggtaaaga cggcaaagat 
aataacgatg gaactcacgt tgttacaatc 
gtgattaaag acggtaaaga tggtaaatca 
ggtagtcata caatcacaat catcaattct 
50 gatggcaaag atggtagaga tggacgtgat 
aaatgtggat gccaagacaa accagtaaca 
ccaaatgtgc cgacaccaga agtaccggtt 
gtaccgacac cagaagtgcc agtacaacca 
gttaaaccag taccagcggt tccagaacaa 
55 actccagtaa atgctaaccc agtagcacca 
ttacctgaaa ctggaagcca atctgattat 
ttgagcctat atgtaggacg aagaaaagaa 
<212> Type : DNA 
<211> Length : 7656 
60 SequenceName : SEQ ID 747 

SeguenceDescription : 



ggacaagcag gtcgtgatgg taaagatgta 4320 

caaggtaaaa atggagataa atatattaat 43 8 0 

aatggaaact gggataaaga aggcaacatc 4440 

ggtgctaaag gcgaaaaagg agaccaaggc 450 0 

aaaggtgcgg atggcgcagt aggtcgtgat 4560 

aaagctaacc cagaagcaca tcaaggtaaa 462 0 

ggcgacgtct tcgttaagaa taacggcaac 4680 

aaaggtgaca aaggtgcaga tggtgctaaa 474 0 

ctgactggag cgcaaggagc taaaggtgcg 4800 

ggacgtgacg gtaaagacgt gttgaacggc 48 60 

gacggcgata aatacgttaa tacagaaaca 492 0 

tgggataaag agggcaacat caaaggctct 498 0 

ggtaagactc cagaagtaac tgtaactcca 5040 

ttcactgttc caggtaaaga tccagttaca .5100 

aacggtaaaa ctccaaaagt tgatttactt 5160 

catacaattg tgacattcta tacagatgaa 5220 

gatgaacttc taggttcaga aatgattaaa 52 8 0 

ggtaaatcat tgcttactgt caaggatggt 5340 

gctaacccag gacaaccatt aaatccagaa 5400 

gatggaaaat cacctacagt tacagctgtt 5460 

gaaatcactg ttgataacca tgatggttca 5520 

gctaagggaa aaactggtgc aaccggtcag 558 0 

cgtggacaag atggccaaag cactgttgta 564 0 

accttcactg taaaagatgg taagaatggt 5700 

gaagatatta cttcaccttc aagaattaga 5760 

cgtaacggta tccgtgttac agtttatgat 5820 

ggtgtcgata aagtattaaa tagtaaagat 58 8 0 

tcagctccaa ctattactac aaaagataat 594 0 

aacccagatg gttctgaatc aacaacagtt 60 0 0 

aatatcacta caacagaaaa cccagatgga 60 60 

ggttcaacta aagaaactgt tgttaaaaac 6120 

gtaacggata acaacgatgg aactcatact 6180 

accaacgcta tcatcaaaga tggtaaagac 6240 

aatccagatg gaagccacac agtaacaatc 63 00 

gttgttaaga atggacgtga cggtgttgac 63 60 

aatggagacg gaagtcatac aatcgttatt 6420 

acagttcgtg atggtaaatc accaaaagtg 6480 

aagatctctg ttctaaatgg tgacggaaca 6540 

tcaccagtag caacagttag agataaccaa 660 0 

ggtaatggta ctgtttctga aaccacagtt 6660 

gtggataatg gagatggaac tcacactatc 6720 

acaactacag ttcgtgatgg tagagaacca 6780 

tcacacacta ttaaagtgac aggtgctgat 6840 

ggtaaatcac caaaagcgaa catcgttgat 69 0 0 

gtagattctg atggtcgtga atacaaatct 6960 

agcgtttcac caactgtaac tgttaaaaat 702 0 

actaatccag atggaagtaa gacagaaatg 70 80 

ccaaaagttt ctgttgaaga taatggtgat 7140 

gatggaactg tgacaaaaac agttattaaa 7200 

ggtcgagacg gcaaagacgg taaagatgga 72 60 

ccatcaaatg acaaaccagt tcctccaaca 7320 

aaacctgtgc cagcgcaacc aacaccaaat 73 80 

actccagctg tttcaacacc agaagtaccg 744 0 

ccagtagtac caacaccggc tcaaccagca 7500 

actacaggta aagaaaaccg tggggacaaa 75 60 

atctctgttc ttttaggtag cggtattcta 7620 

gattaa 7656 



Sequence 



65 <213> OrganismName : Streptococcus pneumoniae R6 
<40O> PreSequenceString : 

atgaagaaaa gaatgttatt agcgtcaaca gtagccttgt catttgcccc agtattggca 60 
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actcaagcag aagaagttct ttggactgca cgtagtgttg agcaaatcca aaacgatttg 12 0 

actaaaacgg acaacaaaac aagttatacc gtacagtatg gtgatacttt gagcaccatt 180 

gcagaagccfc tgggtgtaga tgtcacagtg cttgcgaatc tgaacaaaat cactaatatg 24 0 

gacttgattt tcccagaaac tgttttgaca acgactgtca atgaagcaga agaagtaaca 3 00 

5 gaagttgaaa tccaaacacc tcaagcagac tctagtgaag aagtgacaac tgcgacagca 3 60 

gatttgacca ctaatcaagt gaccgttgat gatcaaactg ttcaggttgc agacctttct 420 

caaccaattg cagaagctcc aaaagaagta gcatcaagtt cagaagttac aaagacagtg 48 0 

attgcttctg aagaagtggc accatctacg ggcacttctg tcccagagga gcaaacggcc 540 

gaaacaagca gtgcagttgc agaagaagct cctcaggaaa cgactccagc tgagaagcag 600 

10 gaaacacaaa caagccctca agctgcatca gcagtggaag caactacaac aagttcagaa 660 

gcaaaagaag tagcatcatc aaatggagct acagcagcag tttctactta tcaaccagaa 72 0 

gaaacgaaaa taatttcaac aacttacgag gcaccagctg cgcccgatta tgctggactt 78 0 

gcagtagcaa aatctgaaaa tgcaggtctt caaccacaaa cagctgcctt taaagaagaa 840 
attgctsact tgtttggcat tacatccttt agtggttatc gtccaggaga cagtggagat . . ,.900 

15 cacggaaaag gtttggctat cgattttatg gtaccagaac gttcagaatt aggggataag 960 

attgcggaat atgctattca aaatatggcc agccgtggca ttagttacat catctggaaa 1020 

caacgtttct atgctccatt cgatagcaaa tatgggccag ctaacacttg gaacccaatg 108 0 

ccagaccgtg gtagtgtgac agaaaatcac tatgatcacg ttcacgtttc aatgaatgga 1140 

taa 1143 

20 <212> Type : DNA 

<211> Length : 1143 

SequenceName : SEQ ID 748 
SequenceDescription : 



25 Sequence 



<213> OrganismName : Streptococcus pneumoniae R6 
<400> PreSequenceString : 

gtgacgattc taggaaaaga tacagttcaa caatctgcga aaggtgaatc tgtaactcaa 60 

30 gaagctacac cagagtataa gctagaaaat acaccaggtg gagataaggg aggcaatact 120 

ggaagctcag atgctaatgc gaatgaaggc ggtggtagcc aggcgggtgg atcagctcac 180 

acaggttcac aaaactcagc tcaatcacaa gcttctaagc aattagctac tgaaaaagaa 240 

tcagctaaaa atgccattga aaaagcagcc aagaacaagc aggatgaaat caaaggcgca 300 

ccgctttctg ataaagaaaa agcagaactt ttagcaagag tggaagcaga aaaacaagca 360 

35 gctctcaaag agattgaaaa tgcgaaaact atggaagatg tgaaggaagc agaaacgatt 420 

ggagtgcaag ccattgccat ggttacagtt cctaagagac cagtggctcc taatgctgct 480 

cc taagacaa caagtgcacc gcaagcaact gcaggaacaa tgcaagatgt tacctaccag 540 

tcacctgctg gcaaacaatt acctaacaca ggttcagcat caagtgcagc acttgctagt 60 0 

cttggtctag tggtggcaac aagtggtttt gctttgctag gaagaaagac tagacgtaga 660 

40 aaatag S66 



<212> Type : DNA 

<211> Length : 666 

SequenceName : SEQ ID 749 
SequenceDescription : 

45 

Sequence 



<213> OrganismName : Streptococcus pneumoniae R6 
<400> PreSequenceString : 

50 atgatgacga caggttgctc tatgggagcc tatcatgcac tcaatttctt cctccagcat 60 

ccagatgtct ttaccaaagt gattgctctc agtggtgttt acgacgcacg tttctttgtc 12 0 

ggtgattact acaacgatga tgctatttac caaaactcgc cagtagatta tatttggaac 180 

caaaacgacg gctggtttat tgaccgttac cgtcaggcag agattgtgct gtgtacgggg 24 0 

cttggagcct gggaacaaga tggtttgcca tccttttaca agctcaaaga agcctttgac 3 00 

55 aagaaacaaa ttccagcctg gtttgctgaa tggggacatg atgtcgccca tgactgggaa 3 60 

tggtggcgta aacaaatgcc ttatttcctc ggtaatctct atttataa 408 
<212> Type : DNA 
<211> Length : 40 8 

SequenceName : SEQ ID 750 

50 SequenceDescription : 

Sequence 



<213> OrganismName : Streptococcus pneumoniae R6 
65 <400> PreSequenceString : 

atgaataaag gattatttga aaaacgttgt aaatatagta ttcggaaatt ttcattaggt 60 
gttgcttctg ttatgattgg agctacattc tttgggacaa gtccggttct tgcagatagc 12 0 
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gtgcagtctg- gttccacggc gaacttacca 
aaagagaatg- atgggcatga ttttgaagcg 
gaagttaca.cj atggacctaa gacagaagaa 
gctgaagaaa. aaccaaaaga ggataaacct 
5 gtaacccctcg aatggcaaac ggtagagaaa 
gaagaaaaag gtgtccgcta caaGcaatta 
aaaccagccc tgtttgaaaa gaagggcttg 
gatttaacct tcaaagatga ttctgaaaag 
tttaaagata. ccaagaataa tgtttttgtc 
10 tataaatctc caacaactag cacttggtat 
ggatcaacaa accgtctctc tatcactctc 
aacgatgtca. atctctttga cacagtgact 
aatgagaaga. agattcttct caaggcgggc 
gttaaaacgcj ataaccaaga gggggtaaaa 
15 ggtcctgaa^ ttgatgatag caaggtgact 
gcagtgattg accaagcctt ccctcgtgtc 
ccaggacagg ttcaacagtt caaccaagtc 
gtcacttata. agaaaatcaa tgagacaaca 
gctcacttaa. tcaatgcgga aatgacagta 
20 tttgatgtgea ccaagattgt caaccacaat 
gaaagaaaac tactttcttc tattagtttc 
gatcaaactcj gtgctaagtt tgatggggca 
gatgatcata. tcgatgtaac caatccaatg 
tttgtttctsi cagataagct tgctgctggt 
25 ggtggttcga. atgactggac tcgtttgaca 
tatgtaggaa tccacagctc tgaatggcaa 
ccagaataca. cgaaggaact tccaagtgct 
gacaagaaag- tcgattggca ggatggtgcc 
caaggttgga. aaaaagttaa ggatatcaca 
30 caagcacaaa acccattect tatgaccttg 
gatggtcttg- ggcaaggtgt tctccttaaa 
cacttgaact atgctgatafc tggtaagcgt 
attgagaagg ctaagaaata tggagctcat 
tatcctgagfc ctaaatactt caatgaaaaa 
35 agctatggtt ggaactggct agatcaaggt 
catggtcgtt tggcacgttg ggaagatttg 
atcfcatgtgg acgtttgggg taatggtcaa 
gttcttgcta aagaaattaa caaacaaggc 
ggtgagtacg actctacctt ccatcactgg 
40 aataaaggta tcaacagtgc catcacccgc 
gtaggggact acagaagtta tggtggtgca 
atgaaagact ttgaaggctg gcaaggaaga 
tttgcccatg acgtcatgac caagtacttc 
ggtacaccgg tgactatgac cgataacggt 
45 gtggaattgg tagatgctga caataataaa 
aatagtccac aatatcgcga acgtacagta 
tcagcttact fcgactccttg gaactgggat 
gaaaagatgfc. actacttcaa tacgcaggcc 
tgggcaaaga gcaaggttta cctttacaag 
50 gaactaactg taaaagatgg taaaattacc 
ctctatcgtt cgaaacaaac caatcctgaa 
gaccaaggat ttaacagtgg taccttgaaa 
gcagaaattg tcaagtctca aggggcaaac 
aaagttagtc tcactcagaa attaactggc 
55 gtcggtgtcg- ataaccgtag taatgccaag 
gaagtgacta cttataccaa taagtctctc 
aatacacgtc gtaacaatgc tacagttgac 
ttctttacaa ctggatcgga cgtatcaaat 
gatgaagcaa cttactttga tgaaattcgt 
60 gacaagcatg atacaggtaa aggcaccttc 
atcttcccat ttgtagtggg tggtgtcgaa 
gaaaaacacg atccatatac acaacgtggt 
gaaggaaatt ggtcactcaa gacaaatgga 
actattccgc aaaacttccg ttttgaagca 
65 gaagcaggtt cagacaatac ctatgctttt 
cgtcgtggta ctcaagcaag caacttggaa 
tctaagaaag ccaagaaggc aaccttcctc 



gctgatttag ctactgctct tgcaacagca 180 

cctaaggtgg gagaagacca aggttctcca 240 

gaactattag cacttgaaaa agaaaaaccg 3 00 

gcagctgcta aacctgaaac acctaagacg 3 60 

aaagaacaac agggaacagt cactatccga 42 0 

tcctcaactg ctcaaaatga taacgcaggt 480 

accgttgatg ccaatggaaa tgcaactgtt 54 0 

ggcaaatcac gctttggtgt cttcttgaaa 60 0 

ggttacgaca aggatggctg gttctgggag 660 

agaggtagtc gtgttgctgc tcctgaaaca 720 

aagtcagacg gtcagctaaa tgccagcaat 78 0 

ctaccagctg cggtcaatga ccatcttaaa 840 

tcttatgacg atgagcgaac agttgttagc 90 0 

acagaggata cccctgctga aaaagaaaca 360 

tatgacacga ttcagtctaa ggttctcaaa 102 0 

aaggaataca gcttgaatgg acatacfcttg 1080 

tttatcaata accaccgaat cacccctgaa 1140 

gcagagtact tgatgaagct tcgcgatgat 12 00 

cgcttgcaag ttgtggacaa tcaattgcac 1260 

caagtcactc caggtcaaaa gattgatgac 132 0 

ctcggcaatg ctttagtctc tgtttctagt 13 80 

accatgtcaa acaatacgca tgtcagcgga 1440 

aaagatctag ccaagggtta catgtatgga 1500 

gtttggagta actctcaaaa cagctatggt 1560 

gcctataaag aaacagtcgg aaatgccaac 1620 

tgggaaaaag cttataaggg cattgttttc 1680 

aaggttgtta tcactgaaga tgccaatgca 1740 

attgcttatc gtagcattat gaacaatcct 18 00 

gcttaccgta tcgcgatgaa ctttggttct 18 60 

gatggtatca agaaaatcaa tctccacaca 192 0 

ggatatggta gcgaaggcca fcgactctggt 1980 

atcggtggtg tcgaagactt caagacccta 2040 

ctaggtatcc acgttaacgc ttcagaaact 2100 

attctccgta agaatccaga tggaagctat 2160 

atcaacattg atgctgccta tgacctagct 2220 

aagaaaaaac ttggtgacgg tctcgacttt 2280 

tcaggtgata acggtgcctg ggctacccac 2340 

tggcgctttg cgatcgagtg gggccatggt 2400 

gcagctgact tgacctacgg tggctacacc 2460 

tttatacgta accaccaaaa agatgcttgg 2520 

gccaactatc cactgctagg tggctacagc 258 0 

agtgactaca atggctatgt aactaactta 2640 

caacacttca ctgtaagtaa atgggaaaat 2700 

agcacctata aatggactcc agaaatgcga 2760 

gtagttgtaa ctcgtaagtc aaatgatgtc 282 0 

actctcaacg gacgtgtcat ccaagatggt 2880 

gcaaatggta agaaactttc tactgataag 2940 

ggtgcaacaa cttggaccct tccaagcgat 3 000 

ctaactgacc aaggtaagac agaagagcaa 3 060 

ctagatcttc tagcaaatca accatacgtt 312 0 

atgtcatgga gtgaaggcat gcacatctat 3180 

cattggacca tttcaggcga tgcttctaag 3240 

gatatgcttc gtattcaagg aaacaaagaa 33 00 

ttgaaaccaa ataccaagta tgccgtttat 33 60 

gcgagcatca ctgtaaatac tggtgaaaaa 342 0 

gccctcaact atgtaaaagc ctatgcccac 348 0 

gatacaagtt acttccaaaa catgtacgcc 3540 

gttactctga cattgagtcg tgaagctggt 3 600 

acctttgaaa acaattcaag catgtacgga 3 660 

aagcaagact ttgaaaatgt tgctcagggt 372 0 

ggtgtcgaag acaaccgcac tcacttgtct 3780 

tggaatggta agaaagtcga tgatgttatc 3840 

ctagtgagcc gtcgtaactt ggtttaccaa 390 0 

ggtaagacct accgtgtaac ctttgaatac 3960 

gtagtcggta agggagaatt ccagtcaggt 4020 

atgcatgaat tgccaaatac ttggacagat 4080 

gtgacaggtg cagaaacagg ggatacttgg 4140 
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10 



gtaggtatct actcaactgg aaatgcaagt aatactcgtg gtgattctgg tggaaatgcc 42 0 0 

aacttccgtg gttataacga cttcatgatg gataatcttc aaatcgaaga aattacccta 4260 

acaggtaaga tgttgacaga aaatgctctg aagaactact tgccaacggt tgccatgact 4320 

aactacacca aagagtctat ggatgctttg aaagaggcgg tctttaacct cagtcaggcc 43 8 0 

gstgatgata tcagtgtgga agaagcgcgt gcagagattg ccaagattga agccttgaag 444 0 

aatgctttgg ttcagaagaa aacggctttg gtagcagatg actttgcaag tcttacagct 450 0 

cctgctcagg ctcaagaagg tcttgcaaat gcctttgatg gaaacttatc tagtttatgg 4560 

catacatcat ggggcggagg agatgtaggc aagcctgcaa ccatggtctt gaaagaagca 462 0 

actgaaatca ctggacttcg ttatgttcca cgtggatcag gttcaaatgg taacttgcgt 468 0 

ga.tgtgaaac ttgttgtgac agatgagtct ggcaaggagc atacctttac tgcaactgat 4740 

t^gccagata acaataagcc aaaagacatt gattttggta agacaattaa ggctaagaaa 48 0 0 

afctgtcctta caggtactaa gacttacgga gatggtggcg ataaatacca atctgcagcg 48 60 

gaactcatct ttactcgtcc acaggtagca gaaacacctc ttgacttgtc aggcfcatgaa 492 0 

gcagctttgg ctaaggctca gaaattaaca gacaaagaca atcaagagga agtagctagc .,498 0 

gttcaggcaa gcatgaaata tgcgacggat aaccatctct tgacggaaag aatggtggaa 5040 

tactttgcag attatctcaa ccaattaaaa gattctgcta cgaaaccaga tgctccaact 510 0 

gfcagagaaac ctgagtttaa acttagctct gtagcttccg atcaaggtaa gacgccagat 5160 

tataagcaag aaatagctag accagaaaca cctgaacaaa tcttgccagc aacaggtgag 522 0 

agtcaatttg acacagccct cttcctagca agtgttagcc tagccctatc tgctctcttt 52 8 0 

20 gtagtaaaaa cgaagaaaga ctag 53 04 
<212> Type : DNA 
<211> Length : 53 04 

SequenceName : SEQ ID 751 
SequenceDescription : 



15 



25 



30 



35 



40 



45 



50 



60 



65 



Sequence 



<213> OrganismName : Streptococcus pneumoniae R6 
<4O0> PreSequenceString : 

atgaaattat ataataaatc agaattacgt tattctcgca ttttctttga taagagacca 60 
ccragcttttg cgtttattct cattatttca actgctatta tcttaagcgg tgcattggtt 120 
ggcgcagctt atatacccaa aaactatatt gtaaaagcga atggaaattc agtcataaca 18 0 

gg-tacagagt tcctatcagc aattagttca gggaaggtag ttactttaca taagtcagaa 240 
gg-agatatgg taaatgctgg agatgtcatt atttcgttat caagtggaca agaaggttta 3 00 

caagcgagct ctttaaataa acaattggtg aagttacgtg caaaagaagc tatctttcaa 3 60 

aaatttgaac aatcattaaa tgagaaatac aaccgtatgt ctaattctgg tgaagaacag 420 
gaatattatg ggaaagttga atactaccta tctcagttaa attcggaaaa ttataataat 480 
gg-tacccagt attcaaagat tcaggatgaa tatacgaagt tgaataaaat aacagctgaa 540 
agaaatcagt tagatgccga cttgcaaact ctacaaaatg aattgattca acttcaacag 600 
caaggagact ccccttcctt atcagatacg acatcagctg atgataaagc taagttagaa 660 
actaagatat tagaaataac aacaaaaatc gaagcattaa aaacaaatat tacttctaag 72 0 

aatagcgaga ttgatagtca acaaagcaat attaaagata tgaaccgtac ctataatgat 780 
ccaacttctc aggcttataa tatttatgct caattagtta gtgagttagg tactgctcgt 840 
fccaaacaaca ataaaagtat tacagagctt gaggctaatc ttggagtggc aacaggtcaa 900 
gataaagctc atagtatatt agcgccaaat gaaggtactc tgcattattt ggtacctttg 960 

aaacaaggaa tgtctattca gcaggggcaa acgatagcag aagtttcagg gaaagaaaaa 1020 

ggttactatg tagaagcttt tgtacttgcg agtgatattt ctcgtgtctc aaaaggagca 10 80 

aaagttgatg ttgctattac tggtgtaaat agtcaaaaat atggaacact aaagggacaa 1140 

gtcagacaga ttgattcagg aacaatttcc caagaaacga aagaggggaa tattagcctc 12 00 

tataaagtca tgatagaatt agaaacctta actctaaaac atggaagtga gacagtcgtc 12 60 

ctccaaaagg atatgccagt tgaagtgcgg attgtctatg ataaagaaac ctatcttgat 13 20 

tggattttgg aaatgttaag tttcaagcaa taa 13 53 
<212> Type : DNA 
<211> Length : 1353 
55 SequenceName : SEQ ID 752 

SequenceDescription : 



Secjuence 

<213> OrganismName : Neisseria meningitidis serogroup A strain Z2491 
<4O0> PreSequenceString : 

atgaataaag gtttacatcg cattatcttt agtaaaaagc acagcaccat ggttgcagta 60 

gccgaaactg ccaacagcca gggcaaaggt aaacaggcag gcagttcggt ttctgtttca 12 0 

ctgjaaaactt caggcgacct ttgcggcaaa ctcaaaacca cccttaaaac cttggtctgc 180 

tctttggttt ccctgagtat ggtattgcct gcccatgccc aaattaccac cgacaaatca 240 

gcacctaaaa accagcaggt cgttatcctt aaaaccaaca ctggtgcccc cttggtgaat 3 00 

atccaaactc cgaatggacg cggattgagc cacaaccgct atacgcagtt tgatgttgac 3 60 
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aacaaagggg cagtgttaaa caacgaccgt 
gcgcaattga ttttgaacga ggtacgcggt 
gtaggcggtc aaaaggccga cgtgattatt 
ggcggcttta aaaatgtcgg tcggggcatc 
. 5 gacggtgcac tgacaggatt tgatgtgcgt 
tggaatgata aaggcggagc cgactacacc 
gggaaattac agggtaaaaa cctggcggtt 
agcggcgaaa tcagtgcagg tacggcagcg 
gccgcactgg gcggtatgta cgccgacagc 
10 ggcgtcaaaa atgccggcac actcgaagcg 
cgcattgaaa acagcggccg catcgccacc 
tatctctcca tcgaaaccac cgaaaaagga 
cggatcgaga gcaaaggctt attggttatt 
ggagccgtgg tgcagaataa cggcagtcgc 
15 aatttggtga fctgagagtaa aactaatgtg 
gccggcggtc gtactacgat caatgatgct 
tccaccaaag gcgatactga attgggtgaa 
gtattatcta acggtagtat tggcagtgct 
attgaatcgg gcaaaccgct ttctttagaa 

20 aacaacggfca acattaaagg cggaaagcag 
gccaaaacta ccaatctgaa tactcccggc 
aatttgaatg ttgataaaga tttgtctgcc 
gcccatatta ccggcaccag taaaaccctc 
ggcttgctga atgttaccaa taccaatctg 

25 gcagccaaag gcaatattca gcttcgcaat 
accaccgcat tgcagggcaa tatcgtttca 
catgtatcct tattggccaa cggtaatgcc 
aaggccgatg tcaatgcagg atcggttggt 
atcacttcat cttcaggaga tattacgttg 

30 ggaaaacaac gcaattcaat caacggaaaa 
gccgacttaa aaaaccttaa cgtccatgcc 
cgggcattga gcatagaaaa fcaccaagctg 
:caacacgagc gggtaacgct caaccaagta 
accggcagcc agatfctggca aaacgacaaa 

35 ggtgtattgg cactcaatgc gcgctattcc 
ggtgcaatca accttactgc cggtaccgcc 
accgtttcga ccaagacttt ggaagafcaat 
aatattgaag caggtagcgg cacattaacc 
accgacctga gcatcaaaac aggcggaaaa 

40 ggtgcgccta gtgctcaagt ttccfccattg 
ggagaaacag atttaagagg ttctaaaatt 
accaaaggca agttgaatat cgaagccgta 
caaaaagcgg ctgaactcaa ccaaaaatcc 
aaaaaaagct cgcctaaaag caagctgatt 

45 gctttctata ttcaagccat caacaaggaa 
tacctgcaag ccaagctttc tgcacaaaat 
atcagcggtt ccgatattac cgcttccaaa 
ccaaaggcag cagattcaga ggcggctgct 
gaaattggca agcccaccta caagagtcac 

50 cgtttgaccg gacgtacggg ggtaagtatt 
attattatcg gtgcatccga aatcaaagct 
agtgatattg tactggaggc tggacaaaac 
aaaagcggca aaatcatcag aaaaaccaag 
ccagcccccg tcgagctgac cgccaacggt 

55 gctaatacca cccgcttcaa tgcccctgca 
cfcgcaactgc tggcagaaga aggcatccac 
cgctttatcg gcatcaaggt aggtaagagc 
aaattgcctg tccgcgtcgt cgcccaaact 
ctcgaaggta ccgaattcaa aaccacgctg 

60 gaaaaagccc gtgtcgatgc gaaaattatc 
gaagaaaaat tagaaaccaa ctcaaccgta 
atcgaaacgc taaaactgcc cagcttcgaa 
ggcggctata tcgtcgacat tccgaaaggc 
aaacagcccg agtatgccta tctgaaacag 

65 caggtgcagc ttgcttacga cagatgggac 
gcggcgatta tcgcactggc cgttaccgtg 
ttgggattaa acggtgcggc cgccgccgca 



aacaataatc cgfcttctggt caaaggcagt 420 

acggctagca aactcaacgg catcgttacc 4 80 

gccaacccca acggcattac cgttaatggc 540 

ttaactatcg gfcgcgcccca aatcggcaaa 60 0 

caaggcacat tgaccgtagg agcagcaggt 660 

ggggtacttg ctcgtgcagt tgctttgcag 72 0 

tctaccggtc ctcagaaagt agattacgcc 780 

ggtacgaaac cgactatfcgc ccttgatact 84 0 

atcacactga ttgccaatga aaaaggcgta 900 

gccaagcaat tgattgtgac ttcgtcaggc 960 

actgccgacg gcaccgaagc ttcaccgact 102 0 

gcggcaggca catttatctc caatggtggt 10 80 

gagacgggag aagatatcag cttgcgtaac 1140 
ccagctacca cggtattaaa tgctggtcat . .,1200 

aacaatgcca aaggctcggc taatctgtcg 1260 

actattcaag cgggcagttc cgtgtacagc 1320 

aatacccgta ttattgctga aaacgtaacc 13 8 0 

gctgtaattg aggctaaaga cactgcacac 144 0 

acctcgaccg ttgcctccaa catccgtttg 15 0 0 

cttgcttfcac tggcagacga taacattact 1560 

aatctgtatg ttcatacagg taaagatctg 162 0 

gccagcatcc atttgaaatc ggataacgct 1680 

actgcctcaa aagacatggg tgtggaggca 174 0 

cgtaccaact cgggtaatct gcacattcag 18 00 

accaagctga acgcagccaa ggctctcgaa 18 60 

gacggccttc atgctgtttc tgcagacggt 192 0 

gactttaccg gtcacaatac cctgacagcc 198 0 

aaaggccgtc tgaaagcaga caataccaat 204 0 

gttgccggca acggtattca gcttggtgac 2100 

cacatcagca tcaaaaacaa cggtggtaat 2160 

aaaagcgggg cattgaacat tcattccgac 2220 

gagtctaccc ataatacgca tcttaatgca 228 0 

gatgcctacg cacaccgtca tctaagcatt 2340 

ctgccttcfcg ccaacaagct ggtggctaac 240 0 

caaattgccg acaacaccac gctgagagcg 2460 

ctagtcaagc gcggcaacat caattggagt 2520 

gccgaattaa aaccatfcggc cggacggctg 2580 

atcgaacctg ccaaccgcat cagtgcgcat 2640 

ttgctgttgt ctgcaaaagg aggaaatgca 2700 

gaagcaaaag gcaatatccg tctggttaca 2760 

acagccggta aaaacttggt tgtcgccacc 282 0 

aacaactcat tcagcaatta ttttcctaca 2880 

aaagaattgg aacagcagat tgcgcagttg 2940 

ccaaccctgc aagaagaacg cgaccgtctc 3 0 00 

gttaaaggta aaaaacccaa aggcaaagaa 3060 

attgacttga tttccgcaca aggcatcgaa 3120 

aaactgaacc ttcacgccgc aggcgtattg 3180 

attctgattg acggcataac cgaccaatat 3240 

tacgacaaag ctgctctgaa caagccttca 33 00 

catgcagctg cggcactcga tgatgcacgt 3360 

ccctcaggca gcatagacat caaagcccat 342 0 

gatgcctata ccttcttaaa aaccaaaggt 348 0 

tttaccagca cccgcgacca cctgattatg 3540 

atcacgcttc aggcaggcgg caacatcgaa 360 0 

ggtaaagtta ccctggttgc gggtgaagag 3660 

aagcacgagt tggatgtcca aaaaagccgc 372 0 

aattacagta aaaacgaact gaacgaaacc 378 0 

gcagccaccc gttcaggctg ggataccgtg 3 840 

gccggtgccg acattcaggc aggtgtaggc 3900 

ctcaaaggca ttgtgaaccg tatccagtcg 3960 

tggcagaaac aggccggacg cggcagcact 402 0 

agccctactc cgcccaaatt gtccgcaccc 408 0 

aatctgaaaa ccgaaatcga aaagctgtcc 4140 

ctccaagtag cgaaaaacat caactggaat 42 0 0 

tacaaacagg agggcttaac cgaagcaggt 4260 

gtcacctcag gcgcaggaac cggagccgta 432 0 

accgatgcag cattcgcctc tttggccagc 43 80 
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caggcttccg tatcgttcat caacaacaaa ggcgatgtcg gcaaaaccct gaaagagctg 4440 

ggcagaagca gcacggtgaa aaatctggtg gttgccgccg ctaccgcagg cgtagccgac 4500 

aaaatcggcg cttcggcact gaacaatgtc agcgataagc agtggatcaa caacctgacc 4560 

gtcaacctag ccaafcgcggg cagtgccgca ctgattaata ccgccatcaa cggcggcagc 4620 

5 ctcaaagaca acttgggcga tgccgcactg ggtgcgatag tcagtaccgt acacggagaa 4680 

gtagcgagca aaatcaaatt taatctcagc gaagactaca ttacccacaa gattgcccat 4740 

gccatagcgg gctgtgcggc agcggcggcg aataagggta agtgtcagga tggtgcgatc 4800 

ggtgcggctg tgggcgagat agtcggggag gctttgacaa acggcaaaaa tcctgccact 4860 

ttgacagcta aagaacgcga acagattttg gcatacagca aactggttgc cggtacggta 4920 

10 agcggtgtgg tcggcggcga tgtgaataca gcggcgaatg cggctaaagt cgcgattgaa 4980 

aataacctat tatctcaaga agagtatgct cttagagaaa aattgatcaa aaaagccaaa 5040 

gggaaaggcc tattatcttt agattggggc agcctgaccg aacaagaggc aaggcagttt 5100 

atctatttga ttgagaaaga tcgatattct aatcaattgc ttgaccgata tcaaaaaaat 5160 

ccaagtagtt taaataatca agaaaaaaat attcttgcat attttattaa ccaaacctct. , 5220 

15 ggaggtaaca cagcttgggc agcttcgata ctgaaaacgc cccagtcaat gggtaafcctc 5280 

actattcctt ccaaagatat taataacacc ttatcgaaag cctatcaaac attgagtcgt 5340 

tatgattctt ttgattacaa atcagctgtt gccgcacaac ctgcacttta cttattaaac 5400 

ggaccgcttg gcttcagtgt caaagcagct actgtggcag caggaggata taacattgga 5460 

cagggagcga aagcaatctc taatggagaa tatctgcatg gtacagttca ggttgttaat 552 0 

20 ggcacattga tggttgcagg atctgtatct gcacaggctg caatatcggc caagcctgca 5580 

cctgttaccc gttatctgag caatgacagt gctcctgctt taagacaagc tttaactgct 5640 

gaaagccaga gaatccgcat gaaactgccg gaagagtatc gacaaatagg gaatcttgcg 570 0 

atagcaaaaa ttgatgttaa aggattaccg caaaggatgg aagcatttag ttctttccaa 57 60 

aaaggggaac atggatttat ttcgttacct gaaacaaaaa tttttaaacc tatatctgtt 582 0 

25 gataaatatc ataatattgc ctctcctcct agaggaacat taagaaatat agatggagaa 58 80 

tataaattac ttgaaactat agcacagcaa ctcggaaata atcgtaatgt atcaggtaga 5940 

attgatctat ttacagaatt aaaggcctgt caatcttgca gcaatgttat tttagagttt 6000 

agaaatcgct atccaaatat tcaattaaat atttttacag gaaaatag 6048 
<212> Type : DNA 

30 <211> Length : 6048 

SequenceName : SEQ XD 753 
SequenceDescription : 



35 



Sequence 



<213> Organ! sm^ame : Neisseria meningitidis serogroup A strain 22491 
<400> PreSequenceString : 

atggacttaa tceaaacccc gaataagcaa tttgtcgacg gcgaccgccg cacgcccggt 60 

actcccgtac ccgcatggtg gctgaaccag ttacaaggcg agttgtacag cattttaaac 120 

40 gcggttggca ttgagcctaa caaagccgac catgcccaag tcttatcggc cattaaaacg 180 

fctggccgccg atgcttcgca ggttgccagt atcgatgctc tgcgtaaata cagcggcaca 240 

ggctatgtga acgtcaacgc ctatcacgcc aatacaacag tgggcggcgg cgtgtttgtg 3 00 

gcggataaag ccgataaatc taccgctgat aacggctgta ccgttattgt ttctaccgac 360 

ggcacgcgct ggaagcgtgt gttttcaggg atgcttaacc tgcatgattt tggatatgtg 420 

45 gccagcaaaa acaatgcact atccactttg aatgccgctg aatctgccgc gcttgacgta 480 

gttgttgatt gcttgggttt gtcaattgat acgggtaata tctacccgca aaaaaacaaa 540 

tacacaaacg gcaagtttgt gattaacggc aaaactgtcg atgttcaata ccagcctatc 60 0 

agaagcggta tcggtcgatt catctccgga actggtgcag cagccaacct caaatcgaat 660 

gaatggactg gcgcgggttt aatcgttatt ggcgaaggcg caatggagca gatggagaaa 720 

50 tgtgtttcct caatcgctat tggcgaccgt gcgcagggct tttctaaagt aagcagggac 78 0 

aacatcgcca ttggggccga cagcctgatt aatgtgcagg ccgctactga atggtacgac 840 

cagtcacgca tggaaggcac gcgcaacatc ggtattggtg gtaatgcagg acgcggcatc 90 0 

accagcggtt actctaatgt gtcaatcggg cgcaatgccg gacagggatt gggtgaaggc 960 

tcgtcaaata ttgcacttgg cgcaggcgcg atggctggta ctgctccagt cggttttagt 102 0 

55 ggcgacattg aagttttttg gccgtcttcg acctcaagaa caatcgcaat cggcgaggct 108 0 

gtcttgcaaa catatcaggg ccgcgccgct caaaccgcaa ttggtgccaa tgcggcgcga 1140 

aatacaaaaa aggccgaaaa agttaccgca atcggttctg ccgcgatgga gaatcttgag 1200 

cgaaaccgcg ccccaaatgg cggagatgtt gtctggacgg gaacggaagc aggtacctac 1260 

gcccaatctg gaaaaaacat cacgcttaca tttcccaaca ttcgcggtgc gcaagcgact 132 0 

60 tattgggtgg gcatccgcct tacatcaggc acggcgcaaa ccttacaaaa cgacgtcgta 13 80 

ccggctcagg tcgtatcagt gaatggcaat acattaatca tccaaagctc aaaagagctg 1440 

accgccaccg gcgcggccga actgaaatac gtttattctg taaattcaac cgctactaaa 15 00 

aacgaagagt tgaccatcat cggcgcgaac gccatgaata aggcattgac cgcaggatac 1560 

tcaactatca tcggcgtaga tgccgcgttg ttgggagaca attatcaaaa aacaaccgca 1620 

65 atcggcgcat catctttacg aacaggtagt catatttcca caactgctat tgggtattgg 1680 

gtaatccctt tggcaagtag tgagaaatgt gttgccattg gagatagtgc gggctatcgg 1740 

aacgttcaag gcgacttttt gactgggaaa ataacaaact ccatcgccat cggatatggc 18 00 
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gcaagaataa acggcgataa cgaaatccaa atcggtacga cagggcaaac tttatatgct 1860 

ccaaccgcgg tgaacatccg ttctgacggc cgcgacaaag cagatgttaa gccgttgacg 192 0 

aacggtttag attttgtaat gaagctcaag ccgatgactg gctactacga ccgccgggat 1980 

tcctacgttg acgaattatt caaagacttg ccggcagatg aacgagcgga caaagtccgc 2040 

5 gaatggtggg cgaatccaat caaggacggc agtcataaag aagatcggtt gcggcattgg 2100 

tttattgccc aggacattgc tgcgctggaa gatgaatatg gtcgattgcc gatggtaaat 2160 

aaaacaaacg atacctacac cgtcgaatac gaaacgttca tccccgtttt gactaaagcc 2220 

attcaggaaa tggccgcaag aattgaaaca ttagaaaccg aaatgaagga atcgaaaaaa 22 8 0 

tga 2283 
10 <212> Type : DNA 

<211> Length : 2283 

SeguenceName : SEQ ID 754 
SequenceDescription : 

15 Sequence ' 



<213> OrganismName : Streptococcus pyogenes strain MGAS8232 
<400> PreSeguenceString : 

atgaaaaata tctcacgcaa atgctttatg accagtgtgg tatgtattat tctaggaggc 60 

20 attcttctag gggcfcggcta tgcaactgga ggacttcagg acattaaaca ccaaacagct 120 

cccaaaaagg tcatcaaaac atttgaccaa ataactgctc ttgatattga cagttctgcc 180 

tcaactatta cagtagagac aggacccgtt caaagaccaa cagtgaccta ttacacacat 240 

cctaaattta ttgaccctat cgtcacaaca ttaacaggta agaccctatc tctttcgcaa 3 00 

aaacccaaag acattgtcat cactggtgga attgaaattt taggttttac tctcaataat 3 60 

25 agtcgtcaag agaagaacta ccgttctatt accattactg ttcctgaaaa aactagcctc 42 0 

aatgaagtta agggaagtaa tgtcccacat accactttgt caaatctaac tgtccaagat 480 

atgcaattcg atggcaatct tactctttta cataccaaag tcaagaaagc tactatcact 54 0 

ggtatgttgg aagccactaa aagtcagtta acaaatctcg agttaaaagc tgactattct 600 

ttttcaaacc tgactgattc tagtgtggaa aatgggacca tcagcttagg aaatggacaa 660 

30 ctaactacta aagataccac tctaaaagcc atcaatattc aatcattaca ccctggcggt 720 

atagaagccg agagaacaac ccttgaaaat gtgaccttca ctgtttctaa aagcaaagaa 780 

gaagaagagg agaacgacta ctafcgacaat gatgctatct tcaccgctca tgcccttacc 840 

cttaaaggga ctaatactat tagtggtggt gratattgatg ttgacataac cttgacaaaa 900 

■gcaaaggcca tcgcctacag ggcaaggacc gaaaatggta aagtctccct tggctcacag 960 

35 ctgacaccag ctaagattgg taaggaatca acttcagatg ttatttctta tgtggctgag 1020 

aataaagcag ctactggaaa tttaacggtt aatctcaata agggagacat tactatcaaa 1080 

tga 1083 
<212> Type : DNA 
<211> Length : 1083 

40 SeguenceName : SEQ ID 755 

SeguenceDe scrip ti on : 



Sequence 



45 <213> OrganismName : Streptococcus pyogenes strain MGAS8232 
<400> PreSeguenceString : 

atgtttaaga aagaaaattt aaaacaacgt tattttaatt ttggattagt agcgttagct 60 

ctaacaatat tagccatcat ttttgccttc tcaagtaaaa atgctgatac taagtcttat 120 

gctaagaagt cagaaagtaa aatggtaaca atcgacaagg ctccaaaaaa taatcatgct 180 

50 attactaaag aagaaagcaa agaaaaagca aagagcattg cttcggagcc tattcccaca 240 

gtagaaaact ctgtagctcc gacagtaaca gaggaagcac cggttgttca gcaagaagtg 3 00 

actcaaactg ttcagcaggt atcttcagta gcctataatc caaataatgt ggtactttcc 360 

aatggaaata ctgctggtat tgtaggaagt caagcggcgg cacagatggc agcagcaaca 420 

ggtgttccac aatcaacttg ggaacatata attgcgcgtg aatctaatgg aaatcctaac 480 

55 gcagctaatg ctfcctggggc atcagggttg ttccagacaa tgccaggttg gggttctaca 54 0 

gcaacggttg aagatcaagt caatgcagcc ttgaaagcct atagtgcaca aggtttatca 600 

gcttggggtt actaa 615 
<212> Type : DNA 
<211> Length : 615 

60 SeguenceName : SEQ ID 756 

SequenceDescription : 



Sequence 



65 <213> OrganismName : Streptococcus pyogenes strain MGAS8232 
<400> PreSequenceString : 

atgttagaag aattgaaaac acttattaaa aatccaaaat taatgattac aatgattggt 



60 
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gtggccctag tgcctgcctt atataattta 
ggtcgggtca atgaccttcc cattgctgtt 
gataagtcat tgacaattgg gaatgatatg 
gactatcatt ttgtatcttc aaaaagtgct 
5 atggtcatta ctttacccga agatctttct 
ccccaaaaac taactatccg ttaccaaact 
atgggggaaa cagcgatggc taagctgaaa 
tatacctcag cagtttttag cagtatgaca 
actggtagtc aagcattgga ttcaggagcg 

10 tcagataact tagcaggttt atctagtgct 
ttaacttcag gattaacggc ctatacagct 
cagctctcaa ccgatatgcc agtttacctg 
tctcaactta atcagggtct ttcacagttg 
gctaaaagaa tccaatcttt ggaggtagga 

15 ctaaatgaaa atctctcaac gatgcaggta 
aacttggctg ccattgctca agctgcgcaa 
aaggaacaac tagcagtctt acaagccact 
caaggggaat taactgctgc tcttacccaa 
caaacgattt taaggtctgt tcaaactttg 

20 gatcagtcaa aacagttgga gcaacttaag 
aatrcaagctt tgccgggagc aagttctgct 
gtaaatggta gcttaaatca acaagttcta 
gcacaattaa acaggtataa tactgccatt 
gccaatgcct tgtcatccaa gtccggagaa 

25 ggtgctacta aactagctga tggtagttct 
agcggattga ctgaattatc aacaggattg 
tctcagcagt tatcgcttgt ttctgtgact 
cttgtgttaa atgagaaaga caaagatggt 
tatatgattg ctgtttctct aatggttgtg 

30 tctttatctg gtcgtccggt caaagataaa 
aatggtttta tttcgactat gggatccatt 
tttgaagccc gttatggtat ggaaacctta 
atggctcttg tcacagcttt ggtcggttgg 
-•gttatgttat tgcfctcaggt tggctcttca 

35 gcatttttcc aaaagttaca tcctttctta 
caaaccattt cattatcagg tcatattgga 
ctggcattta tggtattatc actactcatt 



tcctttctag gctcaatgtg ggatccttat 120 

gttaatcatg ataagccfcgc aaagagagct 180 

gtggacaaga tgtctaaaag taaagattta 240 

caaaaaggtc ttaaaaaagg tgattattat 3 00 

cagcgagcaa caaccttatt aaatcccgaa 3 60 

agtaaaggac atggaatggt cgctgctaag 420 

gagtctgttt cgcaaaacat tacgaagact 480 

gacctccaat caggattaaa agaagcctca 540 

aagacagccc aaatgggtag t caaatgctc 600 

agttggcaat ttcaacaagg aactaatcgt 660 

ggtgttagtc aagtaaagga cggactaggg 720 

aatggggttt ctcggttatc acaaggagct 780 

acacaatcaa caacactttc tgatgataaa 84 0 
ttaccagttt taaatcaagg tattcagcaa - ■ 900 

ccaaaactta ataccgatga gttagggaat 960 

cagttacttg ttaaagaagc tgctgctcac 1020 

agcgcttatc agtcactaac tgcfcgagcaa 108 0 

actgataagg gtgaagctgt ggctcctgct 1140 

tcaacaagtt tacagtctct ctctcaagaa 1200 

gaagctgttg cacagattgc taatcaatcc 1260 

ttaactgaat tatcaacggg attagcaaag 132 0 

ccaggaagta atcaattgac aacaggatta 13 80 

ggttctgggg taataaaact ctcagaaggt 1440 

ttactagatg gtagccatca attafccagaa 1500 

caattgagtc agggtggtca tcaattaacg 1560 
tcaatcttaa atggttcctt agccaaagcc . 1620 

gataaaaatg ctaaagctgt cgcaaaacct 1680 

gttaagacga atgggatcgg gatggcacct 1740 

gccctttcaa ccaacgtcat ttttgctaat 18 00 

tgggattggg ctaaacaaaa atttgttatt 1860 

gttctctact tagctattca attattaggg 192 0 

ggatttatta tgctaagtgg ttggacgttt 1980 

gatgatcgat atggctcttt tgcttcfcttg 2040 

ggtggctctt accccattga gttaagtgga 2100 

ccaatgactt atgtggtatc tggtttacga 2160 

gtagaagtga aagtcttaac tggtttctta 222 0 

tatcgtccca agaaaacagt ctaa 2274 



<212> Type : DNA 
40 <211> Length, t 2274 

SequenceName : SEQ ID 757 
SequenceDescription : 



45 



Sequence 



<213> OrganismName : Streptococcus pyogenes strain MGAS8232 
<400> PreSequenceString : 

atgagcagag acccaacata tacaataaac gagcacgact tatcttttgc agatggtcgt 60 

ttttatgtga cctttaaggc agataagtca agtgagactg tgagacttaa cagtagttgc 12 0 

50 cttggcaata ccataatcaa aaagctacag gtcgaggatg acaatacaat gcacgacttt 180 

gtaaagccta aagttaccac tcaacaagct tttggactag ctcagcaggt caaagagctt 24 0 

gatttacagc taaaagaccc taagtcagat ttgtggggca aaatcaagtt caataataag 3 00 

gcaatgctag tcgagtacgc caacaaagag atgtcaagtg ccattgcgca atcagctgag 360 

cagatattgt tacaagtcaa gtctattgat gatgaacgat attccaaatt tgagcaaact 420 

55 ctgaatggta tcaaacaaac tgtcaaaagt gagtcagttg aatccgcacg tactcagcta 480 

gcatcaatgt ttgatagtcg tattagtgga cttgatggca aatacagtcg tttgagccaa 540 

acaattgata gtcttagcag tcgtcttgat gatggtgttg gtaactactc aacgctatct 600 

caaaaggtaa gtggcattga tttacgagtt agtaatgcag ctaatgatgt ttctcgattg 660 

tctcagacag cacaaggatt gcagtcacaa atcacaaatg caaaccaaaa ttacagcagt 720 

60 ttgtctcaga ctgtacaggg actacaaaca actgtacgtg ataatcaatc aaatgctaca 780 

agtcggatta atcagttaag tgacctgatc agcacaaagg tatcaaaagg tgatgttgag 840 

acaactattg ctcagagtta cgacaagata gccttcgcaa tcagggataa actcccagca 90 0 

agcaaaatgt ctggcagtga gattatctcg gcaatcaatc ttgataggtc tggggttaaa 960 

atcactggaa aaaatatcac tcttgatggt aacagctaca tcagcaacgc tgtcatcaaa 102 0 

65 gatgctcata ttgctaacat ggatgccggt aagattaata ctggttatct taatgctaat 1080 

aggattgcaa ccgaggccat tactggtgag aaaattaaga tggactatgc cttttttaat 114 0 

aaactcactg ctaacgaggg atattttagg acgttgtttg ccaaagacat ctttgcaaca 1200 
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tcagtccaat ctgtaacact atcagctagc aaaattactg gaggtgtatfc agccgctaca 1260 

aatggggcaa gtcagtggga cctaaataat gccaatatga cctfctaatcg agatgccaca 13 20 

attaatttta atagcaaaaa caatgcctta gtacgtaaag atggcacaca tactgctttt 1380 

gtacatttta gtaatgccac accaaaaggc tatagaggct cagcgttgta tgcctctatc 1440 

ggcattacct catcaggaga tggcatcgac agcgcttctt ccggtcgttt tgcagggcta 1500 

aggtcattta ggtacgctac gggatataac catactgctg cagtcgacca aaccgagcta 1560 

tacggtgata atgtcttgat tgcagatgac tttagcatca atcgaggatt taaatttaga 1620 

ccagacaaaa tggaaaaagt gctcgacatg aacgacttgt atgcggctgt agtagcctta 1680 

ggccgctgtt gggggcactt ggctaacgtc ggctggaata ctgctcatag caattttaca 1740 

agtgctgtga gtagggaatt gaataactac atcactaaaa tttaa 1785 
<212> Type : DNA 
<211> Length : 1785 

SequenceName : SEQ ID 758 

SequenceDe script ion . 

Hi 

Sequence 



<213> OrganismName : Streptococcus pyogenes strain MGAS8232 
<400> PreSequenceString : 

atggcagctg atggtaaggt aacgatactt gttgacgttg atggtaagca ggtaaaggta 60 

ctcaatagtg agttagataa agttgccaag cacggtgaca aaggcagctc ctctcttaaa 120 

aaatttgcgg ttggtgcagg agtctttaaa ttagcttcgg ctgcagttga tttggttagt 180 

caatctcttg gcaaggctat cacaagattt gacacgcttg aaaaatatcc aagggtcatg 240 

aaagctatgg ggcatagcgc tgaggatgtt gctagatcaa ctgataagtt agcgaacgga 3 00 

attgatggac taccaacaac tttagacgag gttgtcggaa ccgctcaacg tttgacctct 360 

attactaagg atatcaataa atcaactaat ctcacactag cattaaataa tgccttttta 420 

gcttcaggag cttcatcaga ggctgcaagc cgagggctgg agcaatatgc ccaaatgcta 480 

tcagctggta aggttgatat gcaagcttgg aaaaccctcc aagaaacaat gccttatgcc 540 

ttacaacaaa ctgcggaagc ttttggattt gcaggggcat cggctcaaaa ggatttttat 600 

gaggcgttaa aaaacgggca aataacattt gaccaatttt ctaataagtt gattgagtta 660 

aafcgatggtg tcggcggttt tgcagaacta gccaaagaaa atagtaaagg gattgaaacc 720 

tcttttaaca acatcaagaa cgctattgca aaaggtgtgg ccaatagcat taaggctttg 780 

gatgatttgt ctaaggctgc aacaggtaag ggcatagctg atcattttga tagtttgaaa 840 

•gttgttatca atgcctcttt tagcgccatc aatgcaagta ttaaagctag tacaccgcta 900 

tttaaacttt tgtttagtgt tattggtgct ggaatatcag tcgtcaaagc tctgtcgcct 960 

gccttagttg gtgtagcatc tggtctagct gccatgaggg cagttaatga gactataaca"' 1020 

atgattaaag cgctaaatag agcttgggtt atggcatctg catcaatgag tattggagca 1080 

acaaccatta agactgtgac tgcggtacaa gcggtaagta ccacgatgac taaagcagat 1140 

atggttgcaa gactatctca gttaggtgtc ttaaaagcca gtaccgtgat ttatggtgtt 1200 

atgacaggcg ctatcagttt atctactgct gcaaccatag ccagtactgc tgcggtaact 1260 

gcgttaaaag cagcacttgt agccttaaca ggtcccgttg gttgggtagt tggagctatc 1320 

ggtgctttag ttgctgtcgg agtaagctta tggtcatggc taactaaaga gtcagacgag 1380 

accaaaaagc tgaaaaaaga gcaggagggg ctagtcgaaa gcaacaaaca gctaagagat 1440 

tctgtccgtg agggcgtgca agagcgtaag aagggccttg agtccgtcaa agagagcact 1500 

gcagctcatc aaaaattagc tgacgaaatc attaagttag ccgccaaaga aaacaaaact 1560 

gcaggcgaaa aacaaaactt aaaaaataag attgatcagc ttaatgggtc tattgatggc 162 0 

ttaaacttgg cctatgacaa aaactccaat tctctttctc acaatgcaga tcaaattaag 1680 

tcacgcatta gtgccatgga agcagaaagc acatggcaaa cagcacaaca aaatctgtta 1740 

aatattgaac agaaacgtag cgaggttagc aaaaagctag ctgaaaatgc tgatttgcgt 1800 

aaaaagtgga atgaagaagc taacgtctcc gattccgtcc gaaaagaaaa gattgcagaa 1860 

ctcacagaag aagaagctaa acttaaaaat atgcagactc aactgcagga ggagtataac 192 0 

aagacatcag ctactcaaca agctgctgca gacgctatgg ctgccgcfcga agaatcagga 1980 

tcagcaagac aggttatagc gtacgaaaat atgtcagaag ctcaacgaac tgccatagac 2040 

aatatgcgca ctaagtactc tgaactttta gagacaacga catctatttt tgatgctata 2100 

gaacaaaaga cggcattatc agtagatcaa atgaatacca accttgaaaa aaatagagct 2160 

gctactgaac agtgggctac taatttggag attttggctc agcgtggtgt ggatcaaggt 2220 

attttggagc aactaagacg catgggtcct gaaggagcca cacagacgca agtttttgtg 2280 

gatgcaacag atgccgagct agcacccttg caggaaaact ttagagcagc cacagaaact 2340 

gctaaaaatg caatggggag cgttttagac tcagcaggtg tggaaatgcc agaaaaagtt 2400 

aaagggatgg tcactaatgt ttctacggga ttacaggcgg aactgcaagc tgctaacttt 2460 

gctcaacttg gccaagaaat ccctaatggg gtttctcaag gtataagtca aggggcaggt 2520 

aaagcaagtg acgcaagtgt caaaatgggt caagaagtta aacgctcttt tcaaggagag 2580 

ttgggtatcc actcgccatc gcgagtattt actgagtacg gtggccatat tactgatggc 2640 

ttgagtaatg gtgtgacaaa tggaacgtca aaagttatgc aaaccatgca gagcttggct 2700 

caacagatgt ctcaaaaagg acagcagatt gttaatgaca tgcgtagcaa gtcgaaccaa 2760 

atcacagatg cttttagcac gatgagtggt ccaatgcact ctcatggtgt taatgccatg 2820 

caaggtttgg ccaatggtat ttatgcaggg tcgggggcag ctttagcggc agctcaaagc 2880 
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attgcggcac gtatcaccgc aacaattcaa agtgccttag atatccactc gccatctcgt 2940 

gttatgaggg atgaggttgg acgttttatc cctcagggta tcgctgtagg tattgatgcg 3 0 00 

gatagaaaag tcattgactc atctatgcaa aagctaaaag agtcaatgac gattaatgcg 3 060 

actccagaaa tagcctctgg atttggcgga ggagttgcgg ggattgctaa tcagaccaca 3120 

5 aataactcaa ataacagttt tacccttaat gtcaaggttg atgaatccga cggfcaatagc 3180 

cacgagaaat atcaacgctt attcagagaa tttagctggt atattcaaca acaacaagga 3240 

aggttaggtg atgttaaatg a 3261 
<212> Type : DNA 
<211> Length ; 3261 
10 SequenceName : SEQ ID 759 

SequenceDescription : 



Sequence 

15 <213> OrganismName : Streptococcus pyogenes strain MGAS8232 t:. 
<4 00> PreSequenceString : 

atggctaaag aaccatggga agaaaaaatt gttgatgata ctatagggac acgaacacgt 60 
aaatcaagaa atgctttcat tagcacgcct tggttgactg ctttattaag tgtattcttt 12 0 

gtcatcattg ttgctatact ttttattttc ttctatacat caaatagcgg tagtaataga 18 0 

20 caagctgaaa caaatggttt ttatggagca tccactcata aaaaaacaag gaaagcttct 240 
aacgctaaaa aaacatcaag tagttcaaca actacagaca caacaccttc tagcgaagaa 3 00 

acacttgctt ctagtgaagg aaccggcgaa acccttactg tattggcagg tgagggggca 3 60 

gcttctattg cagctcgtgc aggtatttct gtggagcagt tacaagcact taatccagag 42 0 

cacatgactc aaggatattg gtatgccaat ccaggagatc aagtcactat taaataa 477 

25 

<212> Type : DNA 
<211> Length : 477 

SequenceName : SEQ ID 760 

SeguenceDescription -. 

30 

Sequence 



. ( <213> OrganismName r Streptococcus pyogenes strain MGAS8232 
, r ,<400> PreSequenceString i 

35 atgtctaaaa gaggaaaaat taaaataaca acgaaaacaa agcttattac agctagtgtt 60 

ataacgctag tattaattat aactggagta gtcttgtgga aacaacaaca aaatacgcta 12 0 

acagctgata tcgctaaaga accttactct actgttagtg taactgaagg gagtattgct 180 

tcttcgactt tactatcagg tactgtaaag gctttatcag aggaatatat ttattttgat 240 

gctaataaag gaaatgatgc aactgttaca gttaaaatag gtgatcaggt aacgcagggc 300 

40 cagcaattag ttcaatataa tacaacaaca gctcagtcag cttatgatac tgctgttagg 360 

agtcttaaca agattggccg acaaattaat catcttaaaa catacggagt tcctgctgtt 420 

agtacagaaa ctaataaaga tgaagctacc ggtgaagaga cgacaacaac agttcaacca 48 0 

tcagctcagc aaaatgctaa ttataaacag cagctgcaag atttaaatga tgcttatgca 540 

gatgcacaag cagaagtaaa taaagcgcag atagcgttaa atgatacagt agttatcagt 60 0 

45 agtgtctctg gaactgttgt ggaagtaaat aatgatattg atccttcttc aaagaacagt 660 

caaacacttg ttcacgtagc aaccgaagga cagcttcagg tgaaaggaac attgacagag 72 0 

tatgatttag caaacgttaa ggttggtcaa tctgtaaaaa ttaagtctaa agtttattct 780 

aatcaagaat ggactggaaa aatatcatat gtttcaaact atccaactga gtctaatgca 840 

ggttcaacaa cgccagcagg tagcactgga gcgggaagtt ctacaggagc tgcctatgat 900 

50 tacaagattg atattataag tcctcttaac cagcttaaac aaggtttcac tgtttctgtt 960 

gaggttgtta atgaagctaa acaggcctta gttcctttaa cggctgttat taagaaagat 102 0 

aaaaaacact atgtttggac ttatgatgat gctactggca aagccaaaaa agtagaggtg 1080 

acacttggaa acgcagatgc acaacaacaa gaaattcata aaggagtagc tgttggtgac 1140 

attgttattg ccaatccaga taaaaatatc aaaccggata aaaaactaga aggggttatt 12 0 0 

55 tcaataggta ccaacacaaa accggaaaaa gattctcaat caaagaataa aaaatcaggg 1260 

gtggataaat ag 1272 
<212> Type : DNA 
<211> Length : 1272 

SequenceName : SEQ ID 761 



60 SequenceDescription : 



Sequence 



<213> OrganismName : Treponema pallidum subsp. pallidum str. Nichols 
65 <400> PreSequenceString : 

gtgctccgct tgccaactgc gcgcgcgtgc attacaatgg gcaccatgat cagacataca 60 
tttacgcata ggtgcggagc gctcctgtgc gcgctggcgt tgggaagctc cactatggct 120 
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gcgaccgccg ctgcaaaacc caagaaaggg caaatgcaga aattgcggca gcggccggtg 18 0 

tgggcgccca ccggcgggcg gtatgcgtct ttggacggtg cgtttaccgc gctggcaaat 240 

gatgcaagtt tctttgaggc aaatccggca ggaagtgcga acatgacgca cggggagctg 3 00 

gctttcttcc ataccactgg ctttggctcg tttcacgccg aaacgctctc ttacgtfcggc 3 60 

5 cagtcgggca actggggata cggcgcgtcg atgcgtatgt ttttccctga atctgggttt 42 0 

gacttttcta ccaccacgga gcccgtgtgc acacctgctt cgaaccccat taagcagcgc 480 

ggggcaattg gaatcatcaa ctttgcccgg cgtatcggag gtctctccct gggagccaac 540 

ctgaaggcgg ggttccgcga cgcgcagggc ctgcagcaca cctctgtctc cagtgacatc 600 

ggcttgcagt gggtggggaa cgttgccaag tcctttacct ctgaagagcc caacctgtac 660 

10 atcgggcttg cggccaccaa cttgggattg accgtaaagg tctcggacaa gatagagaac 720 

tgcacgagta cctgtgaaaa gtgtggttgc tgcaaggaga ggtgctgctg caacggcaag 78 0 

aaggcgtgct gcaaggactg cgactgtaac tgcccctgtc aggactgcaa cgacaaaggt 840 

acggtgcacg caacagacac catgctgcgt gcagggtttg cataccggcc cttcagctgg 9 0' 

ttcctcttta gccttggtgc caccaccagc atgaatgtgc agaccttggc tagtagtgac 96*£ 

15 gccaagtcgc tgtaccagaa cctggcttac agcataggcg ccatgtttga tcccttcagc 102 0 

ttcctgagct tgagttcgag ctfcccgcatc aaccacaagg ctaacatgcg agtgggagtg 108 0 

ggtgcagagg cgcgcattgc ccgcattaag ctgaacgcgg gataccgctg tgacgtcagc 1140 

gacatcagca gtgggagtgg gtgcacaggc gcgaaggctt cgcactacct ttccttgggt 1200 

ggcgcgatac tgctcggccg aaattaa 1227 

20 <212> Type : DNA 

<211> Length : 1227 

SequenceName : SEQ ID 762 
SequenceDescription : 



25 Sequence 



<213> OrganismName : Treponema pallidum subsp. pallidum str. Nichols 



<400> PreSequenceString : 

atgagcagaa cgttccgcgc gtggcagtgc gttggtgcgc tgtgtgcgct ctctcccctg 60 

30 ctgcctgcct acagctccga gggcgtgcga gaggtacccc cctcccagtc tccgcaggtg 120 

gtggtggcgt acgagcccat tcgccccggg gat cage tgc tcaaaattgg cattgttgca 180 

ggctgccagt tgtacatagc agggggaaat ggaaccaacg gctcttcgag ttccggcacc 240 

*•■ aacggtaacg geaaeggcaa actgeteggg ggcggggggt ttcacctcgg gtacgagtat 300 

ttttttacca aaaacttttc cctcggcggg caagtttcct ttgagtgtta ccgcacgacc 360 

35 gggtcaaact attacttttc tgttcccatc aeggtaaace ccacgtacac gtttgccgta 42 0 

gggcgctggc gcataccgct ctccctgggc gttgggctca acattcagtc ctafcctcagc 480 

aagaaggege eggggcttat tgeggaagee agegegggge tctactacca gtacaccccg 540 

gactggtcca teggeggcat tgttgcctac aegcagcttg gggacattgc aagctccccc 600 

gaeaagtgea gagccgtggg ccttgccacc attgactttg gggtgcgcta tcacttttag 660 



40 

<212> Type : DNA 

<211> Length : 660 

SequenceName : SEQ ID 763 
SequenceDescription : 

45 
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(57) Abstract: A computational method for identifying adhesin and adhesin-like proteins, said method comprising steps of com- 
puting the sequence-based attributes of a neural network software wherein the attributes are (i) amino acid frequencies, (ii) multiplet 
frequency, (iii) dipeptide frequencies, (iv) charge composition, and (v) hydrophobic composition, training the artificial neural Net- 
work (ANN) for each of the computed five attributes, and identifying the adhesin and adhesin-like proteins having probability of 
being an adhesin (P a d) as > 0.51; a computer system for performing the method; and genes and proteins encoding adhesin and ad- 
hesin-like proteins. 
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